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Abstract 


The  2005  GFD  program  was  entitled  "Fast  Times  and  Fine  Scales"  with  a  focus  on  asymptotic 
and  stochastic  modeling  methods  that  exploit  a  physical  scale  separation  of  some  kind.  An 
extremely  strong  application  pool  resulted  in  the  appointment  of  the  unusually  large  class  of 
eleven  GFD  Fellows  for  the  summer.  The  first  week  consisted  of  principal  lectures  from  Joe 
Keller  on  waves  in  fluids,  ray  methods,  and  a  variety  of  applications.  The  second  week  was 
divided  between  Eric  Vanden-Eijnden's  lectures  on  Brownian  motion  and  stochastic  differential 
equations,  and  George  Papanicolaou's  lectures  on  variational  principles  and  asymptotic  methods 
in  homogenization  theory.  The  principal  lectures  were  particularly  well-attended  but  the  lecture 
room  at  Walsh  Cottage  proved  up  to  the  task  of  accommodating  the  full  audience. 

Research  lectures  by  staff  and  visitors  were  delivered  daily  throughout  the  program  addressing 
topics  ranging  from  applications  of  multiscale  modeling  methods  in  ocean  and  atmosphere 
dynamics,  to  applications  of  stochastic  methods  in  populations  dynamics  and  chemical  kinetics, 
to  applications  of  homogenization  theory  in  materials  science  and  engineering.  The  program 
also  included  a  popular  public  lecture  on  the  timely  subject  of  tsunamis.  As  usual  this  summer 
ended  with  the  Fellows'  reports  including  two  experimental  projects  and  theoretical 
work  on  a  variety  of  problems  inspired  by  the  summer's  research  theme. 

Oliver  Biihler  and  Charlie  Doering  acted  as  co-Directors  for  the  summer.  Janet  Fields,  Jeanne 
Fleming  and  Penny  Foster  provided  the  administrative  backbone  for  the  program.  Keith  Bradley 
supplied  technical  support,  and  Matt  Finn  ran  the  computer  network  and  graciously  helped  with 
the  production  of  the  summer's  proceedings  volume.  As  always  we  are  grateful  to  Woods  Hole 
Oceanographic  Institution  for  the  use  of  Walsh  Cottage,  the  perfect  setting  for  the  GFD  program. 
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Lecture  1:  Fluid  Equations 

Joseph  B.  Keller 


1  Euler  Equations  of  Fluid  Dynamics 

We  begin  with  some  notation;  x  is  position,  t  is  time,  g  is  the  acceleration  of  gravity  vector, 
u(x,t)  is  velocity,  p(x,t)  is  density,  p(x,t)  is  pressure.  The  Euler  equations  of  fluid  dynamics 
are: 


Pt  +  v  •  ( pu )  =  0 

Mass  conservation 

(i) 

p[ut  +  ( u  ■  Vu)]  =  Vp  -  pg 

Momentum  equation 

(2) 

P  =  P(P,  S,  T ) 

Equation  of  state. 

(3) 

Here  T—  temperature  and  S=  salinity  in  the  ocean  or  humidity  in  the  atmosphere.  (l)-(3) 
represent  five  equations  for  five  unknown  functions  (p,  u ,  p).  We  assume  T,  S  are  given. 

Note 

(a)  (l)-(3)  hold  in  air  with  the  ideal  gas  law 

(b)  (l)-(3)  hold  in  water  with  the  equation  of  state  for  water 

(c)  instead  of  (l)-(3),  the  equations  of  solid  mechanics  (elasticity,  plasticity)  hold  in  the 
interior  of  the  Earth 

(d)  certain  conditions  must  hold  at  the  interfaces  between  air  and  water,  water  and  solid, 
where  the  solutions  are  discontinuous 

1.1  Boundary  conditions 

To  study  the  motion  of  the  water  and  its  upper  surface  we  assume  that: 

(a)  the  location  of  the  bottom  surface  is  known: 

z  =  - h(x,y,t ) 

(b)  the  pressure  in  the  air  at  the  top  surface  is  known: 

Pair(x ,  y,  t )  =  patr[x,  y,  T)(x,  y,  t),t}. 

At  the  top  surface  z  =  rj(x,  y,t);  ?y  is  to  be  determined. 

•  Kinematic  condition 

At  each  free  surface,  the  normal  velocity  u  •  v  of  the  fluid  in  the  direction  of  the  unit 
normal  v  is  equal  to  K,  the  normal  velocity  of  the  surface. 


1 


Bottom: 


{hX'hy, 1) 
\/hi+hy  +  l 


uhx  +  vhy  +  w  =  —ht 


-urjx  -  vrjy  +  w  =  r)t 


y  —  ,  ~fet 
at  z  =  —h(x,y,t) 


at  2  =  r?(x,  y,  t ) 


(4) 

(5) 


•  Dynamic  condition 

At  z  =  77(2;,  y,t),  the  pressure  in  the  water  equals  the  pressure  in  the  air  (ignoring  sur¬ 
face  tension): 


p[.r,  y,  r/(x,  y,  t),t]  =  pmr(x,  y,  t )  at  2  =  rj(x,  y,  t ) 


(6) 


Note 

(a)  one  condition  at  the  bottom  where  h(x.y.t)  is  known. 

(b)  two  conditions  at  top  where  rj(x,y,t)  is  unknown. 

1.2  Initial  conditions 


p(x,0)  =  B(x)  (7) 

u(x,0)  =  U(x)  (8) 

=  N(x,y)  (9) 

To  solve  the  initial  boundary  value  problem  for  motion  of  the  water  we  must  find  p{x,t), 

p(x y(x,y,t)  satisfying  (l)-(9)  given  g ,  an  equation  of  state,  T(x:t),  5(a;,£), 
h(x,y,t),pair(.x,y,t),  H(x).  U(x),  N(x,y). 

1.3  Hydrostatic  equilibrium  in  a  horizontally  stratified  ocean 

Suppose 

p0/ir  =  consiant  =  po?  h  —  h{x,  y),  T  =  T(^),  5  =  5(^).  (10) 


Then  a  solution  to  the  Euler  equations  is: 

u  —  0.  1]  =  0.  p  =  p{z ),  p  =  p{z).  (11) 

Equation  (1)  is  satisfied  as  are  the  boundary  conditions  (4),  (5)  and  the  x,  y  components 
of  (2).  The  z  component  of  (2)  becomes: 

^  =  -9pW)X{z),S{z)},  z  <  0  (12) 

with  boundary  condition 


p(  o) = po¬ 


rn 
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The  solution  of  (12)  and  (13)  is  called  the  hydrostatic  pressure. 

To  solve  for  p(z )  we  need  an  equation  of  state.  An  approximate  equation  of  state  for 
seawater  is: 


p  =  po  —  aT  +  PS,  po,  a,  P  =  constants  >  0. 

The  solution  of  (12)  and  (13)  is: 

p(z)=JH>-9  Poz  +  a  [  T{z,)dzf  —  (3  [  S(z')dz'  , 

Jo  Jo 


z  <  0. 


(14) 


(15) 


The  solution  (11)  solves  the  initial  boundary  value  problem  when  the  initial  values  (7)-(9) 
are  also  also  given  by  (11).  How  do  we  find  solutions  for  different  initial  and  boundary 
data? 


1.4  Perturbation  method  for  solving  problems 

Consider  an  equation  depending  on  a  parameter  e 

F(u,e)  =  0 .  (16) 

The  unknown  u  may  be  a  function  of  x,t  or  a  collection  of  functions  like  p,  u ,  p,  p,  and  F 
may  be  a  set  of  equations  like  (l)-(6)  with  initial  conditions  (7)-(9).  The  solution  of  (16), 
u(e),  will  depend  upon  e .  If  it  is  a  smooth  regular  function  of  c,  we  can  expand  it  in  a 
Taylor  series: 

u(e)  =  u(  0)  +  eu(0)  +  0(e2)  (17) 


u  = 


du(e) 

“dTUo 


Suppose  that  we  can  solve  (16)  when  e  =  0,  i.e.  we  know  u(0).  Then  we  can  differentiate 
(16)  with  respect  to  e  and  set  e  =  0: 


Fu[u{0),0]u(0)  +  Fe[u{0),0)=0.  (18) 

If  we  solve  (18)  for  ft(0)  then  (17)  will  give  u(e)  with  an  error  0(e2).  A  better  approximation 
can  be  obtained  by  keeping  the  term  yii(0)  and  differentiating  (16)  twice  to  get  an  equation 
for  ii(0)  etc. 


This  is  the  regular  perturbation  method,  and  e  is  the  perturbation  parameter.  Equation 
(18)  is  called  the  variational  equation  of  (16).  It  is  linear  in  the  unknown  it(0),  so  it  is  called 
the  linearized  equation  (linearized  about  ^(0)).  It  can  also  be  derived  by  substituting  (17) 
into  (16)  and  expanding  in  powers  of  e. 


Note  that  e  can  be  any  parameter,  or  set  of  parameters,  which  occurs  in  one  or  more 
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of  the  differential  equations  or  in  the  boundary  or  initial  conditions.  The  linear  operator 
Fw[ix(0),0]  in  (18)  is  always  the  same.  Only  the  inhomogeneous  term  Fc[u(0),0]  depends  on 
what  the  parameter  is. 

Here  are  some  examples  of  how  we  can  perturb  hydrostatic  equilibrium. 

Let  po,  Iiq(x,  y),  po  be  the  equilibrium  solution,  and  then  set 


Pair  =po  +  m  (x,  y,  t ) 

atmospheric  disturbance 

(19) 

h  =  h0{x,y )  +ehi{x,y,t) 

bottom  motion  (earthquake,  landslide) 

(20) 

V(x,t)  =  0  +  ei]i(x,y) 

initial  elevation  or  depression 

(21) 

u,(x,p,0)  =  0  +  eui(x,y) 

initial  motion 

(22) 

p(x,  ?/,  /.)  =  p0{z)  +  epi  (x,  y,  z) 

initial  density  anomaly. 

(23) 

To  get  the  linearized  equations,  we  linearize  about  the  equlibrium  solution.  We  start  with 

(1): 


Pi  +  V(pu)  =  0. 


We  differentiate  with  respect  to  e 


pt  +  V  •  ( pu  +  pit)  —  0. 

We  set  e  =  0  and  then  u  =  0,  p  =  pq.  Hence 

pi  +  v  •  {po{z)u)  =  o.  (i) 

Similarily,  linearizing  (2)- (9)  we  get 

poUt  =  -Vp-  pg  (2) 

P  =  PpiPo,  T,  S)p  +  prT  +  psS  (3) 

uliQx  +  vh()y  +  w  ~  —ht  at  2  =  — /^(x,  y)  BC  at  bottom  (4) 

w  =  iy  at  z  =0  BC  at  top  (5) 

p  +  pzi)  =  p\  at  z  =0  BC  at  top  (6) 

p(.r,  0)  =  pi  at  t  =0  (7) 

u(x ,  0)  =  uj  at  t  =0  (8) 

?)(x,0)  =  7]\  at  t  =0.  (9) 


Equations  (1)  -  (9)  are  the  linear  equations  for  p ,  u.  p,  f]  and  constitute  an  initial  boundary 
value  problem. 

2  Fluids  of  Constant  Density 

Now  let’s  suppose  that-  p  is  a  constant.  Then  we  can  cancel  p  from  the  continuity  equation 
to  get 

V  •  w  =  0. 
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This  condition  states  that  u  is  divergence  free.  The  momentum  equation  becomes 

/  X~7\  1 

ut  +  {u  *  v)u  =  --p  -  g. 

P 

Recall  we  also  had  an  equation  of  state,  relating  the  density  to  temperature  and  salinity. 
Here  this  equation  is  simply  p  =  po5  a  constant. 

Since  the  density  is  constant,  (15)  becomes 

p(z )  =po-  gpoz- 

Linearizing  the  equations  around  a  rest  state  yields  the  linear  system 


V  •  u  =  0 

ut  =  — Vp. 

P 

At  the  beginning  of  this  section  we  assumed  the  fluid  was  of  constant  density,  and  showed 
that  this  implies  a  divergence  free  velocity  field.  If  we  assume  instead  that  when  we  follow 
a  fluid  element  the  density  does  not  change  (rather  than  assume  that  the  whole  fluid  has 
constant  density),  we  get 

Dp  _ 

—  Vp  =  0. 

The  conservation  of  mass  equation  is  then  p(V  •  u)  =  0.  Thus  if  the  fluid  is  incompressible, 
this  also  implies  a  divergence  free  velocity  field. 


Free  Surface 

P,1,r  =  p(p>i  -0,-  |(V<>)2) 
t}t  +  v  •  Vq  =  0 


Bottom 

i/ .  Vb  =  —ht 

Z  =  —  //(.T,  t) 


Figure  1 :  Irrotational  flow  of  an  incompressible  ideal  fluid 
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3  Irrotational  Motion 


Now  let’s  introduce  the  quantity  uj  =  V  x  u,  the  vorticity  of  the  fluid.  If  we  assume  that 
(or  begin  with)  a  fluid  where  this  quantity  is  zero,  then  we  say  the  motion  is  irrotational. 
When  u(x,t)  is  a  curl-free  vector  field,  then  we  can  introduce  a  potential  function 

</>(x,t)  =  I  u(s,t)ds. 

Jx  0 

Here  this  integral  is  evaluated  along  any  curve  C  from  :ro  to  x.  It  is  a  calculus  exercise 
to  show  that  this  potential  is  well  defined  only  when  we  have  an  irrotational  vector  field 
(hint:  use  Stokes’  Theorem).  Notice  now  that  u  =  V0  so  if  an  incompressible  fluid  is  in 
irrotational  motion,  then  conservation  of  mass  gives 

V  •  u .  =  V  •  V<£  =  A <p  =  0. 

Thus  (/)  solves  Laplace’s  equation,  or  in  other  words,  0  harmonic.  Now  consider  the  mo¬ 
mentum  equation 

ut  +  (u  •  V)u  H — Vp  =  W. 

P 

Here  V  is  a  potential  function  for  the  force,  /,  acting  on  the  fluid.  Now  we  can  plug  in 
u  =  V0,  to  get: 

V0t  +  l/2V(V</>)2  +  -Vp-  VF  =  0. 

P 

If  f)  is  a  constant,  we  can  integrate  this  equation,  to  get: 

<P,  +  1/2(V<P)2  +  -p  -  V  =  B(t). 

P 

This  is  called  Bernoulli’s  Equation.  Here  B(t)  is  an  integration  constant,  which  we  can  set 
to  zero.  II'  B  was  not  zero  then  we  could  consider  a  new  system  where  </>  is  redefined  to  be 
k/;  =  (f>  +  j(j  B(s)ds,  and  see  that  %b  solves  Bernoulli’s  equation  with  zero  right  hand  side. 
Since  the  gradient  of  ij)  and  </;  agree,  and  it  is  this  gradient  we  are  interested  in,  we  can 
safely  set  B  to  zero.  Now  we  can  solve  for  the  velocity  and  pressure  by  solving  for  (j)  using 
Laplace’s  Equation,  and  then  solving  for  p  using  Bernoulli’s  equation.  The  boundary  value 
problem  for  (j)  and  ?/  is  shown  in  Figure  1. 

In  Figure  1,  we  have  a  linear  pde  for  (j)  with  a  linear  boundary  condition  at  the  bottom 
boundary.  On  the  surface,  we  have  two  nonlinear  boundary  conditions  imposed  at  a  location 
which  depends  upon  the  solution.  In  linearizing  this  system  about  a  rest  state,  all  the 
equations  stay  the  same  except  those  on  the  free  surface.  There  we  get 

i)t  +  0  ■  V0  +  v  •  V0  +  {v  •  V02)?)  =  0  z  =  0 

pair  _  pg. fj  _  +  1(V0)2  z  -  0. 

Since  the  rest  stale  is  </;  =  0  and  i]  =  0.  tliese  linearized  surface  equations  become 

Vi  -4>z  =  o 

</><  +  9V  =  -  \vmr 
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z  =  0 
2  =  0. 


Now  we  can  differentiate  the  second  equation  and  plug  it  into  the  first  to  get  a  single  linear 
free  surface  condition 

4>tt  +  9<i>z  =  -- -i>TT • 

When  p  and  <fi  are  time  harmonic  with  time  dependence  exp(— iut)  this  becomes 

W2  1  ■ air  In  n 

- a - —  (f)z  =  0  z  —  0. 

9  pg 

The  equations  for  the  fluid  below  the  surface  are 

(fin  = -ht  z  —  -h 

A <fi  =  0  —  h  <  z  <  0. 

These  last  three  equations  govern  time  harmonic  small  amplitude  irrotational  motion  of  an 
inviscid  fluid  of  constant  density. 

Notes  by  Benjamin  Akers  and  Tiffany  Shaw. 
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Lecture  2:  Gravity  waves  and  the  method  of  stationary  phase 

Joseph  B.  Keller 


1  Gravity  waves  on  a  layer  of  uniform  depth 

In  addition  to  the  boundary  conditions  on  the  upper  and  lower  interfaces,  </>  must  satisfy 
both  the  initial  conditions  and  boundary  conditions  on  lateral  surfaces  or  at  infinity.  We 
consider  free  vibrations,  in  which  neither  the  bottom  elevation  nor  the  pressure  at  the  free 
surface  are  perturbed.  Moreover,  we  specialize  to  the  case  of  uniform  depth  We  now 
summarize  the  governing  equations  derived  in  the  last  lecture: 

v2<£  =  0,  — /i.(0)  <  2  <  0  ,  (1) 

subject  to  the  boundary  conditions 

—  0  on  je:  =  —  =  /?</>  at  2  —  0.  (2) 

oz  uz 

Recall  that  </;  represents  the  spatial  variation  of  the  velocity  potential,  and  we  have  defined 
a  wavenumber  (5  =  u;2/#.  We  seek  to  solve  this  system  of  equations  using  separation  of 
variables:  <fi(x,y,z )  =  U(x,y)f(z).  After  this  substitution,  (1)  yields 


U xx  +  U'i 


yy 


U 


The  boundary  conditions  (2)  become 


/'(_/, <°))  =  0  ,  and  /'( 0)  =  (3f( 0)  . 


(3) 

(4) 


In  equation  (3)  since  the  left  hand  side  is  a  function  of  only  x  and  y  and  the  right  hand 
side  is  a  function  of  z  only,  both  must  be  constant,  equal  to  some  —  k2.  f  and  U  then  must 
satisfy 

uxx  +  Uyy  +  k2U  =  0  ,  (5) 

f"  =  k2f  .  (6) 

(5)  is  called  the  Helmholtz  or  reduced  wave  equation.  Solving  for  /  and  applying  the  first 
of  the  boundary  conditions  (4),  we  obtain 

f(z)  =  Acosh[k(z  +  h)}  ,  (7) 
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where  A  is  an  arbitrary  constant,  and  we  have  suppressed  the  superscript  upon  h .  In  order 
to  satisfy  the  free  surface  boundary  condition,  we  need  that  /csinh(fc/i)  =  /?cosh (kh),  or 
equivalently 

u2  =  gktanh(kh)  .  (8) 

This  is  the  dispersion  relation ,  fc  is  the  wavenumber ,  and  u;  is  the  angular  frequency .  It  may 
seem  surprising  that  (8)  only  has  one  positive  and  one  negative  real  solution.  There  are 
infinitely  many  pure  imaginary  roots  k  =  in  to  this  equation,  but  these  represent  evanescent 
modes. 

There  are  two  physically  interesting  special  limits 


•  Deep  water: 

•  Shallow  water 


kh  — *  oo  cj2  ~  gk. 

(9) 

kh  — >  0  J1  ~  ghk2. 

(10) 

Another  useful  concept  is  the  phase  velocity  defined  by 

VoJk  as  kh 
^fgh  as  kh  — ►  0 


_  L 0  „  j  y/gjk  as  kh  — *  oo 


k 


(11) 


Notice  that  to  leading  order  the  phase  velocity  is  independent  of  the  wavenumber  in  the 
shallow  water  limit.  Media  with  this  property  are  said  to  be  non- dispersive. 

We  can  now  solve  for  the  x  and  y  variations  of  the  velocity  field.  Plane  wave  solutions 
of  the  Helmholtz  equation  have  the  form 

U{x)y)  =  etfea3,  where  |fc|  =  k  .  (12) 


Therefore 


<f>(x,  y,  z,  t,  k )  =  Ael(k  x  ^  cosh[fc(2  +  /i)], 


r){x,y,t,k)  =  -  -fa 
9 


=  —Aei{kx~ut)  cosh  (kh). 
z= o  9 


(13) 

(14) 


Finding  the  wavenumber  for  a  prescribed  frequency  to  requires  solving  a  transcendental 
equation  (8).  It  is  easier  to  give  k  and  then  determine  u  directly  from  (8). 


2  Trajectories  of  fluid  particles 

By  further  integration  we  can  obtain  expressions  for  the  motion  of  the  water  particles.  We 
choose  axes  so  that  k  points  in  the  x-direction,  and  consider  the  small  excursions  of  a  fluid 
particle  (x  +  fe(t),  z  +  5z(t)),  where  5x  and  8z  evolve  according  to  the  linearized  equations: 

=  ikAei{kx-wt)  cosh  [k{z  +  h)]  (15) 

=  kAei[kx~ut)  sinh [k(z  +  h)\  .  (16) 


d5x 

dt 

d,5z 

dt 
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Figure  1 :  (Blue  curve)  Superposition  of  two  waves  of  narrowly  separated  wavenumber  and  frequency. 
(Green  curve)  The  predicted  curve  envelope.  Here  5k  =  kj  10. 


On  integrating  up,  and  explicitly  writing  out  the  real  part  (assuming  that  A  is  real  for 
convenience)  we  arrive  at: 

Ah 

Sx(t)  = - cos(kx  —  ut)  cosh[fc(z  +  h)]  (17) 

U 

Ak 

5z(t)  = - sm(kx  —  ut)  sinh[fc(2  +  h)\  .  (18) 

u 

The  particles  therefore  trace  out  an  ellipse  in  the  (x,  z)  plane,  with  horizontal  axis  (Ak/u)  cosh[k(z+ 
h)]  and  vertical  axis  (Ak/u)smh[k(z  +  /?,)]. 

3  Group  velocity 

First,  consider  the  superposition  of  disturbances  having  slightly  different  wavenumbers,  and 
identical  amplitudes: 

<j>±  =  A  sin[(/r  ±  8k)x  —  {u  ±  5u)t]  .  (19) 

The  sum  of  the  two  velocity  potentials  may  be  simply  written: 

=  2 A  cos (Skx  —  5ut)  sin (kx  —  ut)  .  (20) 

The  sin  and  cos  factors  describe  respectively  the  wave,  traveling  at  the  phase  velocity 

c  =  u/k)  and  the  much  slower  oscillations  in  the  amplitude  envelope,  which  propagates 

at  the  group  velocity  cg  =  du/dk  ~  5u/6k.  This  ‘beat’  structure  persists  for  wave  packets 
with  any  small  spread  of  wavenumbers.  The  wave  packet  travels  at  the  group  velocity,  while 
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individual  wave  components  move  through  the  wave  packet  at  their  phase  velocities.  For 
gravity  waves  on  a  uniform  fluid  layer  we  have: 


c  /  2 kh 

C9  =  2  *  +  “ 


sinh(2  kh)') 


Again  we  consider  two  limits  for  (21): 


U 


\c  as  kh  oo 

as  kh  0 


(21) 


(22) 


Now  we  calculate  the  period-integrated  energy  flux  F  through  a  surface  £=const,  with 
unit  width  in  the  y-direction.  This  is  equal  to  the  rate  of  working  of  the  pressure  force 
across  the  surface,  or  since  in  the  linear  approximation  p  =  — pd(f)/dt ,  to: 


(23) 


where  T  =  Note  that  it  is  only  necessary  to  evaluate  the  2  integral  over  the  interval 
~h  <  z  <  0;  the  contribution  from  the  sliver  0  <  z  <  rj  is  of  lower  order.  After  some 
calculations,  we  discover 

TP  A  2  .2 

cosh 2(kh)cg.  (24) 


rt+T 

l^dS 

It 

Js  dt  dn 

Fav  —  j,  — 


F  A2Plj 2 


2  g 


Next  let  us  calculate  the  total  energy  (associated  with  the  flow)  in  a  box  R:  {xo  <  x  <  xo+A, 
yo  <  y  <  yo  + 1,  -h  <  z  <  rj}. 


E  =  pj  J  +  gz}dV  +  gh2  . 


(25) 


The  integral  represents  the  total  (kinetic  plus  potential)  energy  of  the  fluid  contained  within 
the  box,  and  the  additional  summand  is  simply  the  hydrostatic  potential  energy  which  must 
be  subtracted  off.  With  a  little  algebra,  we  find  that 


En 


From  (21),  (24)  and  (26)  we  get 


E  A2pu2 


cosh  2(kh). 
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Fav  —  EavCg  , 

so  that  the  energy  of  the  wave  propagates  with  the  group  velocity  cg . 


(26) 

(27) 


4  Superposition  of  plane  waves 

A  general  solution  of  the  wave  equation  can  be  written  as  a  linear  combination  of  plane 
wave  solutions  of  the  type  considered: 


r?(x,  y,  t)  =  f  d 2k  U(k)  cosh  kh  ei(fc  x-wt)  -  B{k)  cosh  kh  <.<(*•*+«*)) 

J  9 


(28) 
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The  amplitude  functions  A(k)  and  B(k)  can  be  determined  to  make  rj  and  rjt  satisfy  initial 
conditions.  The  fluid  elevation  at  time  t  —  0  is 

rj(X)  y,  0)  =  [  d 2k  cosh  kh  ( A(k )  —  B(k ))  etk  x  .  (29) 

J  9 

Thus  the  Fourier  transform  r )  of  the  initial  fluid  height  is  related  to  A  and  B  by 

0)  =  cosh  kh(A(k)  -  B(k ))  .  (30) 

9 

Similarly  for  the  Fourier  transform  of  the  initial  velocity  of  the  free  surface, 

?/i(fc,0)  =  cosh  kh(A(k)  +  B(k))  .  (31) 

9 

Thus  from  (30)  and  (31)  the  amplitude  functions  are  determined  by: 


Vi  -  iurj |,=0  = 

2a;2 

- cosh  khA(k)  , 

9 

(32) 

Vt  +  Ml /=0  = 

2a;2 

- cosh  khB(k)  . 

(33) 

However  convoluted  the  initial  conditions  upon  the  surface  elevation,  we  can  obtain  the 
complete  evolution  of  the  fluid  surface  height  if  only  we  can  evaluate  these  integrals.  In 
the  days  when  computation  was  expensive,  this  was  a  knotty  problem,  since  the  range  of 
integration  in  wave  number  space  extends  out  to  infinity,  and  the  amplitude  functions  may 
decay  only  weakly  as  the  wavenumber  is  made  vanishingly  small.  In  the  next  section,  we  de¬ 
scribe  a  powerful  asymptotic  method  for  evaluation  of  the  integral  far  from  any  disturbance 
sources. 


5  Asymptotic  evaluation  of  integrals 


We  consider  the  simplest,  case  of  a  one-dimensional  disturbance,  and  seek  to  evaluate  the 
following  integral: 

r]{x,  t)  =  f  A(k)ei{kx-^k^dk  .  (34) 

J  —  OC 

When  x  and  t  are  large,  the  integrand  of  (34)  oscillates  very  rapidly  and  everything  is 
canceled  out.  So  the  dominant  contribution  to  i ?(a;,/)  comes  from  the  neighborhood  of  k  = 
ks,  a  stationary  point  at  which  the  derivative  of  the  phase  function  S(fc,  x,  t)  =  kx  —  u>(k)t 
with  respect  to  k  vanishes.  We  approximate  S  about  k  =  ks  by  a  Taylor  series  expansion: 

S(k)  «  S(ks)  +  Sk(ks)(k  -  ks)  +  £S**(fc)(fc  -  ks )2  +  . . .  .  (35) 


Noting  that  Sk ( ks )  =  0,  we  find  that  (34)  can  be  approximated  by  the  Gaussian  integral 


~  A(kg)e 


,iS(ks 


ei  skK.(ks)(k~ks)2  dk 


Ad-  \JS(ks) 

M,)  yfe M  ’ 


(36) 
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where  ±  =  sgn(Skk(ks ))•  Therefore,  for  t  1, 


±Mf) 


ei{ksx-u>{ks)t )  'v/^’e  4 

’ 


(37) 


since  £**  =  — ul,(k)t .  We  must  sum  over  the  contributions  from  each  of  the  wavenumbers 
giving  group  velocities  cg(ks)  =  x/t.  For  the  given  dispersion  relation  (8),  there  is  precisely 
one  such  wavenumber.  This  method  is  called  the  method  of  stationary  phase. 

There  is  a  singularity  in  (37)  if  un(ks)  =  0,  i.e.  at  the  maximum  attainable  group 
velocity.  From  the  form  of  the  dispersion  relation  (8)  it  may  be  determined  that  this  occurs 
iff  k  =  0  (the  ‘if’  statement  is  an  obvious  corollary  of  cg  being  an  even  function  of  /c,  but 
the  converse  statement  requires  a  little  more  work).  Much  of  the  energy  introduced  into  the 
medium  at  the  source  piles  up  in  the  fastest-traveling  part  of  the  wave  packet.  It  follows 
that  the  formula  is  not  valid  in  the  neighborhood  of  values  x,  t  where  x/t  =  c5(0)  =  y/gh. 
To  correct  it,  it  is  necessary  to  extend  the  Taylor  series  for  x/t  close  to  y/gh: 


/oo 

d  ki 

-OO 

/OO 

. 

-oo 


i(k-ks)3Skkk(ks)/6 


0d=icr3/6 


(  16tt3  \ 1/3 


IWfcs)|V3  \\Skkk(ks)\ 


Aksx—iu>t 


Ai{ 0)  (38) 


Since  Skkk{ks)  is  proportional  to  t  at  ks  —  0,  77  decays  as  t-1/3.  This  decay  is  significantly 
weaker  than  the  *-1/2  decay  at  points  behind  the  front.  In  order  to  obtain  a  uniform 
approximation  which  combines  (37)  for  x/t  /  y/gh,  and  (38)  for  x/t  ^  y/gh:  we  can  retain 
both  the  cubic  and  quadratic  terms  in  the  expansion  of  S(k).  The  resulting  integral  can  be 
written  as  an  Airy  function  for  77.  This  integral  was  first  used  by  George  Airy  in  his  analysis 
of  light-scattering  by  spherical  raindrops,  the  first  quantitative  model  for  the  distribution 
of  colors  in  a  rainbow. 

This  transitional  expression  is  valid  near  the  front  x/t  =  y/gfi  but  it  becomes  inaccurate 
far  from  the  front.  A  result  valid  over  the  whole  range  of  x  and  t  can  be  obtained  by  writing 
S(k)  as  a  cubic  polynomial  in  a  new  variable  in  a  suitable  way.  This  method,  developed  by 
Chester,  Friedman  and  Ursell,  yields  an  Airy  function  of  a  more  complicated  argument.  It 
was  applied  recently  to  tsunami  waves  by  Berry. 


Notes  by  Marcus  Roper  and  Ay  a  Tanabe. 
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Lecture  3:  Asymptotic  Methods  for  the  Reduced  Wave 

Equation 

Joseph  B.  Keller 


1  The  Reduced  Wave  Equation 


Let.  v  (t,  x)  satisfy  the  wave  equation 


Av  — 


1  d^v 
c2  (X)  dT 


=  0, 


(1) 


where  c(X)  is  the  propagation  speed,  at  the  point  X. 
<?(f)u(X).  Then 


c2(X) 


Av.  (X) 


g"(h 

g(t) 


Separate  variables,  letting  v  ( t ,  X)  = 


(2) 


So 


g"  (t)  +  u;2#  (/)  -  0 


(3) 


and 


A  u  + 


(X) 


u  =  0. 


(4) 


Here  the  const  ant  ce  is  the  angular  frequency.  Equation  4  is  known  as  the  reduced  wave 
equation  or  the  Helmholtz  equation.  Introduce  a  constant  reference  speed  Co,  and  define 
the  index  of  refraction  n(X)  =  co/c(X)  and  the  propagation  constant  (or  wave  number) 
k  —  u/c Q.  Then  the  reduced  wave  equation  (4)  becomes 


Au  +  (X)  u  =  0. 


(5) 


2  Leading  order  asymptotics 

When  n(X)  is  constant,  the  reduced  wave  equation  has  the  plane  wave  solution 

u  (X.  K)  =  z  (K)  einK  X .  (6) 

Here  the  propagation  vector  K  is  any  constant  vector  such  that  |K|  =  A:,  and  the  amplitude 
z  (K)  is  a  constant.  In  the  case  of  ??(X)  not  constant,  the  plane  wave  solution  motivates 
looking  for  solutions  to  (5)  of  the  form 

tt(X)  =  z{X,k)eik*w.  (7) 


14 


Here  z(X,  k )  is  the  amplitude  and  s(X)  the  phase.  Substituting  this  form  into  the  reduced 
wave  equation  (5)  yields 

- k 2  [(Vs)2  -  n2  z  +  2 ikS/s  ■  Vz  +  ikzAs  +  Az  =  0.  (8) 

We  are  interested  in  looking  at  the  asymptotic  behavior  of  solutions  to  the  reduced  wave 
equation  (5)  as  k  — ►  oo.  To  explore  this,  we  suppose  2  (X,  k)  has  an  asymptotic  expansion 
of  the  form 

00  00 

*  (X,  fc)  ~  (X)  (ik)-m  =  Y,  zm  (X)  (ik)-m  ,  zm  =  0  for  m  =  -1,  -2, ... .  (9) 

m= 0  m=— oo 

Here  ~  denotes  asymptotic  equality.  The  asymptotic  expansion  above  means  that  for  each 
n  >  0 

n 

Z(X,  k)=Y,Zm  (X)  (ik)~m  +  O  (k-n)  ,  (10) 

rn—0 

where  the  notation  o(k~n)  denotes  a  term  for  which  lim^oo  kn\o  (k~n)  |  =  0.  Note  that 
an  asymptotic  expansion  may  not  converge!  However,  by  truncation  of  the  series  we  get  an 
approximation  with  an  error  which  tends  to  zero  as  k  — »  0.  Substituting  the  asymptotic 
expansion  (9)  for  z(X,  k)  into  (8)  yields 

2  ( ikf-m  {  [(Vs)2  -  n2]  zm+ 1  +  [2  Vs  •  Vzm  +  zmAs)  +  Azm^  }  ~  0.  (11) 

m 

The  coefficient  of  each  power  of  k  must  be  zero.  For  m  =  —  1  this  gives 

(Vs)2  -  n2j  20  =  0,  (12) 

since  zm  =  0  for  m  —  —  1,—  2, ...  .  Assuming  zo  7^  0,  this  implies  the  eikonal  equation  for 
the  phase  5, 

(Vs)2  =  n2  (X) .  (13) 

m  =  0  yields  the  transport,  equation  for  the  leading  order  amplitude  zo , 

2Vs  *  V^o  +  =  0.  (14) 

m  =  1,2,...  yield  further  transport  equations  for  determining  the  other  zm.  We  shall 
concentrate  on  the  leading  order  amplitude  zq  in  what  follows.  The  leading  order  solution 
zo  (X)  el/cs(x)  is  known  as  the  geometrical  optics  field, . 

3  Phase,  Wavefronts,  and  Rays 

Surfaces  of  constant  phase,  defined  by  $(X)  =  constant,  are  called  wavefronts.  Curves 
orthogonal  to  the  wavefronts  are  called  rays  (or  more  generally,  characteristics) ,  and  are 
used  to  solve  for  s(X).  We  write  the  equation  of  a  ray  in  terms  of  a  parameter  a  in  the 
form 

X  =  (xl,X2,xs)  =  X(a)  (15) 
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Orthogonality  of  the  ray  and  the  wavefronts  implies 


djcj_  =  ds_ 
da  dxj  ’ 


(16) 


where  A(X)  is  an  arbitrary  proportionality  factor,  and  j  —  1,2,3.  Now,  dividing  the  above 
expression  by  A,  differentiating  with  respect  to  a,  and  using  the  summation  convention  we 
have  that 


d  / 1  d,xj\  _  d  (  ds  \  _  dxi  d2s  _  ds  d2s  _  A  d  /  ds  ds  \ 

do  \X  do  )  do  \dxj  )  d,o  dxidxj  dx^  dx^dxj  2  dxj  dx,{ ) 


Then,  using  the  eikonal  equation  (13)  on  the  right  hand  side  we  have  that 


1  d,  /  1  dxj  \  d  ( n2  \ 

A  do  \  A  do  )  dxj  \  2  ) 


(18) 


Furthermore,  substituting  the  orthogonality  equation  (16)  into  the  eikonal  equation  (13) 
yields 


dxj  dxj 
d,o  d  o 


2_2 


An 


(19) 


The  four  equations  given  by  (18)  and  (19)  are  known  as  the  my  equations .  The  three 
equations  given  by  (18)  arc  second  order  ordinary  differential  equations  for  the  rays  X(cr), 
and  (19)  gives  the  variation  of  o  along  the  ray.  The  rays  are  determined  solely  by  n(X) 
once  the  initial  values  for  (18)  are  specified  and  the  arbitrary  proportionality  factor  A(X) 
chosen . 

Since  A  is  arbitrary,  we  may  choose  it  as  we  please.  When  A  “  n~3  the  ray  equations 
become 


dxj 

do  V  do 


dxj  dxj 


do  do 


<9 

~  dxi 
=  i. 


(20) 

(21) 


(21)  implies  that  o  is  simply  the  arc  length  along  the  ray.  When  A  =  1  with  a  replaced  by 
r,  the  ray  equations  become 

d2Xj  _  d  ( n2\ 

d,r 2  dxj  \  2  / 

dxj  dxj  o 
=U  ' 


(22)  has  a  natural  interpretation  in  terms  of  classical  mechanics,  with  the  left  hand  side 
being  an  acceleration  and  the  right  hand  side  being  the  gradient  of  a  potential.  Also,  from 
(21)  and  (23)  we  can  see  o  is  related  to  r  by 


do  —  yj  dxjdxj  =  nd,r. 


(24) 


cqt  is  known  as  the  optical  length  along  a  ray. 
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4  Ray  solution 


The  eikonal  equation  (13)  can  be  solved  for  the  phase  s.  Using  the  orthogonality  (16)  we 
have  for  the  derivative  of  s  along  a  ray 


b  'XM>= 


ds  dxj 
dxj  da 


ds  ds 
dxj  dxj 


A  n2 


This  can  be  integrated  to  give  the  solution  for  s 


(25) 


s[X  (a)]  =  s[X(<t0)]  +  ^  A  [X  {o')]  n2  [X  (a')]  do'. 
Joo 


(26) 


The  transport  equation  (14)  can  be  solved  for  the  leading  order  amplitude  zq.  Again 
using  orthogonality,  we  find  that 


Vs  •  V^o 


ds  dzo  1  dxj  dzo 
dxj  dxj  A  da  dxj 


(27) 


Thus  the  transport  equation  (14)  becomes  a  first  order  ordinary  differential  equation  along 
the  ray 

T~p-  +  zo^s-0.  (28) 

A  da 

Given  initial  conditions  (28)  can  also  be  integrated  to  solve  for  zo .  However,  there  is  a  more 
direct  way  to  solve  for  zq.  Note  that  (14)  implies 


V  •  (zqVs)  =  zq  (2Vzo  *  Vs  +  zqAs)  =  0, 


(29) 


Introduce  a  region  R  bounded  by  a  tube  of  rays  containing  the  given  ray,  and  by  two 
wavefronts  W(ao)  and  W(a)  at  the  points  ao  and  a  of  the  given  ray  (Figure  1).  Then 
the  gradient  of  the  phase,  Vs,  is  parallel  to  the  sides  of  the  tube  and  normal  to  its  ends. 
Integrating  (29)  over  R  and  using  the  divergence  theorem  yields 

0  =  /  v  •  (^0  Vs)  dV  =  f  zfos  ■  Nda  -  f  z$Vs  ■  N da.  (30) 

Jr  J\V(cr)  Jw(ao) 

Here  N  is  a  unit  vector  orthogonal  to  the  wavefront  and  da  is  an  element  of  area  on  the 
wavefront.  From  the  eikonal  equation  (13)  we  have  that  Vs  •  N  =  n.  Then,  by  shrinking 
the  tube  of  rays  to  the  given  ray  we  obtain  the  solution  for  zo  from  (30) 

Zq  (o)  n  (o)  da  (o)  =  z2  (<r0)  n  (o0)  da  (cr0) .  (31) 


This  can  be  written  more  conveniently  in  terms  of  the  expansion  ratio  £  (cr)  with  respect 
to  a  reference  point  a\  on  the  ray,  defined  by 


da  (a) 
da  (a i ) 


(32) 


17 


Figure  1:  Ray  tube  and  wavefronts  defining  region  R . 


The  expansion  ratio  measures  the  expansion  of  the  cross-section  of  a  tube  of  rays,  and  is 
simply  the  Jacobian  of  the  mapping  by  rays  of  W  (a i)  on  W  (a).  (31)  then  becomes 


Z[)  (a)  =  Zq  (<7q) 


{vo)n(ao) 
.  £  {?)  n  {a) 


(33) 


Importantly  we  note  that  the  amplitude  zq  (a)  varies  inversely  as  the  square  root  of  n£ 
along  a  ray.  Thus  for  n  constant,  as  rays  converge  the  amplitude  zq  increases,  and  as  rays 
diverge  zq  decreases. 


5  Case  of  Homogeneous  Media 


A  homogeneous  medium  is  defined  as  one  where  the  propagation  speed  c(X),  and  thus 
n(X)  =  cq/c(X),  are  constants.  If  A  =  n~\  the  ray  equations  (22)  become 


d2Xj  d  ( n 2  \ 

dr 2  dxj  \  2  / 


=  0. 


(34) 


(34)  gives  that  the  rays  are  straight  lines.  The  equation  (26)  for  the  phase  s  becomes 


s(cr)  =  s(cr0)  +  n(<7  -  CT0). 


(35) 


To  determine  the  amplitude  zq(g)  using  (33),  we  need  to  determine  the  expansion  ratio 
£((j).  To  calculate  the  expansion  ratio,  look  at  two  intersecting  rays  which  form  an  infinites¬ 
imal  angle  d.6\,  as  in  Figure  2.  Now  take  any  two  wavefronts  W (0)  and  W(a)  intersecting 
these  two  rays.  Denote  the  distance  between  them  as  a,  and  the  distance  to  the  intersection 
point  to  be  pi,  the  radius  of  curvature  of  FF(0).  Then  we  can  calculate  the  infinitesimal 
area  ratio  to  be 


d.a(a)  _  (pi  +  o)d,6\  _  p\  +  a 
da{ 0)  p\dB\  pi 


(36) 
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pi  +  a  is  the  radius  of  curvature  of  W(a).  (36)  states  that  the  expansion  ratio  is  just  the 
ratio  of  the  radii  of  curvature  of  the  two  wavefronts.  (33)  then  becomes 


z0  (a)  =  z0  (cr0) 


Pi  +00 


1/2 


Pi  +  o 


(37) 


To  use  this  analysis  works  in  three  dimensions,  we  must  take  twp  cross  sections  which 
slice  the  wavefront  into  its  curves  of  maximal  and  minimal  curvature.  In  3-D  this  analysis 
gives  the  expansion  ratio 

_  (pi  +  0p(P2  +  o) 

PlP2 

The  3-D  analogue  of  (37)  is 


z0{o)  =  z0(a0) 


(Pi  +0o)(p2  +  0~o) 
(pi  +ct)(p2  +  a) 


(39) 


6  An  Initial  Value  Problem  for  the  Eikonal  Equation 

Here  we  consider  the  solution  of  the  eikonal  equation  with  initial  data  s(x)  given  at  a:  on  a 
manifold  M,  ie  a  point,  line  or  surface.  The  eikonal  equation  is 

(Vs)2  =  n2.  (40) 

To  make  s(x)  unique,  we  impose  the  condition  that  the  solution  is  outgoing.  Mathematically, 
this  condition  can  be  expressed  as 

Vs  •  N  >  0,  with  N  =  The  unit  outward  normal  from  M.  (41) 

Here  M  is  the  initial  surface  from  which  the  solution  is  outgoing. 


We  will  solve  this  problem  using  the  method  of  characteristics.  When  the  initial  data 
is  given  at  a  point  p,  we  can  define  the  solution,  using  the  previous  theory,  on  each  ray 
emanating  from  p. 
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When  the  initial  data  is  given  on  a  curve  C  we  must  determine  the  angle  at  which  rays 
emanate.  This  can  be  done  by  parameterizing  the  curve  by  arc  length,  77.  Now  the  initial 
condition  s\c  =  so (77)  with  a  parameterization  Xo(t7)  of  the  curve  C,  yields  the  equation 
s0 (??)  =  s(Xq(77)).  Differentiation  yields 


Vs- 


rfXo 

drj 


dsp 

drj 


(42) 


Now  we  use  the  vector  identity  a  •  b  =  |a||b|  cos/3,  where  a  and  b  are  vectors  and  /?  is  the 
angle  between  them.  Then  (42)  gives  that  the  angle  73(77)  between  the  tangent  vector 
to  the  curve  C  and  the  direction  Vs  of  the  ray  is  given  by 


cos  (/?(??))  = 


1  ds 


0 


1 


dso 


(43) 


I  Vs  |  drj  n  [X0(t7)]  drj  ' 

We  have  now  shown  how  to  solve  the  initial  value  problem  with  initial  data  at  a  point 
p  or  on  a  curve  C  in  an  infinite  domain.  Similar  analysis  works  when  initial  data  is  given 
on  a  surface.  Next  we  will  consider  what  happens  when  the  domain  has  boundaries. 


7  Reflection  From  a  Boundary 


In  order  t.o  consider  reflection  from  a  boundary  33,  we  must  first  prescribe  a  boundary 
condition.  We  will  take  the  general  impedance  boundary  condition,  with  impedance  Z: 

r\ 

+  ikZ(X)u  -  0,  X  on  J3,  v  =  V5.  (44) 

ov 

Notice  that  the  limits  Z  — >  0  and  Z  — >  oc  yield  the  simpler  Neumann  and  Dirichlet  boundary 
conditions.  To  satisfy  the  boundary  condition  (44),  we  must  introduce  a  reflected  wave,  ur 
in  addition  to  the  incident  wave,  ul.  Her c  we  will  try  the  same  type  of  expansion  for  ur 
that  we  have  been  using  for  ul 

oc 

(45) 

m=0 

Now  plugging  u  =  ur  +  u7  into  the  boundary  condition,  we  see  immediately  that  the  phases 
must  match  on  the  boundary  for  these  waves  to  add  to  zero 

5r(X)  =  s7 (X) ,  X  on  B.  (46) 


Now  collecting  powers  of  (ik)  we  get  the  following  equation  for  the  leading  order  amplitudes 


-0 


+ 


^0 


=  0. 


(47) 


Thus 


(48) 


We  have  found  the  phase  and  amplitude  of  the  reflected  wave  on  the  boundary.  Then  we 
can  use  the  previous  method  to  construct  the  reflected  wave. 
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8  Reflection  From  a  Parabolic  Cylinder 


To  illustrate  how  this  method  works  we  will  consider  the  example  of  waves  reflected  by  a 
parabolic  cylinder.  Physically  we  could  envision  this  to  be  the  example  of  waves  hitting  a 
vertical  cliff  with  a  parabolic  profile  when  viewed  from  above.  Here  we  will  consider  the 


Figure  3:  Reflection  from  a  parabolic  cylinder. 

simple  problem  of  an  incoming  plane  wave  eikx ,  with  the  x  axis  the  axis  of  symmetry  of 
the  parabola.  We  will  also  take  a  uniform  bottom,  so  that  n  =  1,  and  the  cliff  face  to  be 
rigid,  =  0  for  x  6  B.  Now  using  the  fact  that  parabolas  focus  all  rays  to  a  point,  we  see 
that  the  rays  reflected  from  the  boundary  of  the  cliff  will  all  emanate  from  the  focus  of  the 
parabola.  Thus  the  wavefronts  will  be  circles  centered  at  the  focus  of  the  parabola  (Figure 
3).  If  we  define  our  coordinate  system  with  origin  at  the  focus  of  the  parabola,  we  get  the 
phase  of  the  reflected  wave  to  be 

s(r)  =  so  +  r.  (49) 

Here  r  is  the  distance  from  the  origin,  as  in  polar  coordinates.  We  can  also  determine  the 
reflected  amplitude.  Here  matching  the  incident  and  reflected  amplitude  on  the  boundary 
and  using  equation  (37)  give 

zo{r)  =  (50) 

Here  r  =  ro(6)  is  the  equation  of  the  parabola.  Thus  we  get  the  leading  order  solution  to 
be 

u  =  U*  +  ur  ~  eikx  +  ^^eik{so+r).  (51) 

Similar  analysis  enables  us  to  determine  the  higher  order  terms  in  ur . 

Notes  by  Ben  Akers  and  John  Rudge. 
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Lecture  4:  Geometrical  Theory  of  Diffraction  (continued)  and 

the  Shallow  Water  Theory 

Joseph  B.  Keller 


1  Introduction 

In  this  lecture  we  will  finish  discussing  the  reflection  from  a  boundary  (Section  2).  Next,  in 
Section  3  we  will  switch  to  the  geometric  theory  of  diffraction  without  formal  details  (see 
[3]).  Further,  in  Section  4,  we  will  generalize  the  surface  water  wave  theory  to  a  case  of 
non-uniform  depth  (see  [2]).  This  theory  predicts  infinite  amplitude  at  the  shoreline,  hence, 
in  Section  5,  the  shallow  water  theory  is  introduced  to  fix  that  problem. 


2  Reflection  from  a  boundary 

In  the  previous  lectures  we  developed  a  method  for  solving  the  Helmholtz  equation 

Af/  +  k2n2(X)U  -  0.  (1) 


It  yields  an  asymptotic  approximation  like  geometrical  optics.  It  was  applied  to  reflection 
by  a  parabolic  cylinder.  Now  let  us  analyze  reflection  of  waves,  in  water  of  constant  depth 
h  =  const,  by  an  arbitrary  smooth  boundary  B ,  e.g.  a  vertical  river  bank.  In  this  case  the 
velocity  potential  is  given  by  an  incident  contribution  <pl  and  a  reflected  part  (pr .  Since  the 
normal  velocity  on  the  boundary  vanishes,  dy(t)\B  =  0.  The  zero-order  asymptotic  solution 
is  given  by 

4>  =  <P  +  <f +  zl{x)eiksr.  (2) 

The  condition  of  vanishing  normal  derivative  yields 
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ikS1 


d_ 

dn 


(x)  ~  0.  (3) 


Upon  equating  the  exponents  in  (3),  we  get 


=  Sr  on  B. 


(4) 


Then  upon  equating  to  zero  the  coefficient  of  k  in  (3),  we  get 
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-S'  +  zk  —  S'=  0. 


H)d7i  T~°d« 


(5) 
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^Prorn  (4),  the  tangential  derivative  equals  Then  from  the  eiconal  equation,  the 
normal  derivatives  are  related  by 


dn  ) 


dr  J 


(  dSr  \  2  _ 


2  /  c\  rti  \  2 

\dn) 


V. 


(6) 


Thus  dSr/dn  =  ±dSl/dn.  The  “+”  sign  would  yield  Sr(x )  =  Sl(x),  so  we  must  choose  the 


sign 


Then  (5)  yields 


dn 


gg 

On 


zg  =  011  -B- 


(7) 


(8) 


From  dSr  /dr  =  dSl/(h  and  (7)  we  get  law  of  reflection,  familiar  from  geometrical  optics. 


Instead  of  a  rigid  boundary,  we  can  also  consider  the  impedance  boundary  condition 

r\ 

■%-<!> +  ikZ<j>  =  0.  (9) 

on 

This  condition  is  frequently  used  in  electrodynamics,  and  in  acoustics  for  compliant  bound¬ 
aries.  In  this  case  it  follows  that 


sr(X)  -  X  on  B, 


Ji 

“m 


dsl 

dv 


zo  ( 

(%7  +  Z)  +  zm  ( 


+  z)+zl{^  +  Z)  =  Q, 


d£  +  z)  +  djkr l  + 


dzl 


X  on  B, 

=  0,  m  >  1,  X  on  B. 


(10) 

(11) 

(12) 


In  terms  of  the  incidence  angle  a  and  the  impedance  Z,  (11)  gives  the  reflection  coefficient 


Zq  n  cos  a  —  Z 
Zq  n  cos  a  +  Z  ’ 


(13) 


again  for  I  on  B. 

Fermat’s  principle  of  geometrical  optics  follows  from  the  eiconal  equation.  It  states 
that  a  ray  travelling  between  two  points  takes  the  path  with  the  shortest  optical  distance. 
For  media  with  n(x)  =  const,  we  can  find  this  path  by  imagining  a  string  tightly  spanned 
between  these  two  points.  Then  the  ray  path  will  lie  along  this  string. 

To  calculate  the  wave  field  numerically  one  often  uses  the  finite  element  method.  Due 
to  the  oscillatory  nature  of  the  wave  field,  this  procedure  requires  very  small  elements.  But 
we  know  that  the  leading  order  approximation  has  the  form  ZQ(x)elkS1  +  ZQ(x)eikS1 .  We  can 
use  this  ansatz,  calculating  the  phase  by  means  of  rays.  After  that  the  amplitudes  Zq(x) 
and  Zq(x)  can  be  calculated  by  the  finite  element  method.  This  allows  us  to  use  much  larger 
elements. 


23 


Figure  1:  Scattering  by  a  smooth  object.  Regions  for  asymptotic  expansion:  A  -  region  of 
ordinary  geometrical  optics,  B  -  point  at  which  incident  rays  are  tangent  to  the  boundary, 
S  -  the  object  boundary,  C  -  shadow  boundary,  D  -  shadow  region. 

3  Geometrical  theory  of  diffraction 

The  asymptotic  expansion  method  presented  in  the  previous  lectures  is  incomplete  because 
of  phenomena  which  are  usually  not  taken  into  account  by  ordinary  geometrical  optics. 
Let  us  for  example  consider  water  waves  being  scattered  by  an  island  which  we  assume 
to  be  an  oval-shaped  object  with  smooth  boundary.  From  ordinary  geometrical  optics 
follows  that  there  exists  a  shado  w  region  D  (see  Figure  1)  in  which  the  intensity  of  waves 
is  zero.  This  region  is  separated  from  the  region  A ,  reached  by  incident  and  reflected 
rays,  by  a  surface  called  the  shadow  boundary .  Obviously,  along  this  boundary  the  solution 
obtained  by  the  ordinary  geometrical  optics  method  is  discontinuous.  However,  this  is  in 
sharp  contradiction  with  the  fact  that  solutions  of  the  Helmholtz  equation  (1)  are  smooth 
a, way  from  the  boundary.  Agreement  of  asymptotic  solutions  with  actual  solutions  can  be 
achieved  by  introducing  boundary  layer  solution  in  the  neighborhood  of  shadow  boundaries. 
They  can  be  found  by  using  asymptotic  expansions  of  certain  exactly  solvable  problems, 
or  constructed  by  boundary  layer  techniques.  The  construction  of  asymptotic  solutions 
requires  different  expansions  in  different  regions. 

We  first  will  consider  asymptotic  solutions  in  the  shadow  region  D .  The  only  rays  which 
reach  this  region  are  rays  diffracted  by  the  boundary  S.  They  are  called  surface  diffracted 
rays.  To  construct  them  we  introduce  surface  rays  (or  creeping  rays )  which  propagate 
along  the  boundary  5.  The  point  B  (see  Figure  2)  on  the  boundary  between  the  shadow 
region  D  and  the  illuminated  region  A  acts  as  a  source  for  these  rays.  Note  that  at  this 
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Figure  2:  Surface  diffracted  rays  omitted  by  the  propagating  surface  ray  starting  at  the 
point  B. 


point  the  incident  ray  is  tangent  to  the  boundary.  Therefore,  the  incident  ray  splits  into 
two  branches.  One  branch  goes  along  the  shadow  boundary  C  while  the  other  is  the  ray 
travelling  along  the  boundary  S.  The  latter  radiates  surface  diffracted  rays  into  the  shadow 
region,  and  therefore  the  boundary  S  acts  as  a  secondary  source.  Because  of  this  radiation, 
the  intensity  on  the  surface  decays  exponentially  with  distance  along  the  ray.  The  surface 
ray  travels  infinitely  many  times  around  the  boundary.  Thus  it  sends  an  infinite  number  of 
surface  rays  to  each  point  in  the  shadow,  and  also  to  each  point  in  the  illuminated  region. 
Thus  the  complete  wave  field  in  the  shadow  region  is  an  infinite  sum  of  diffracted  fields  on 
surface  diffracted  rays. 

The  wave  field  on  and  near  the  shadow  boundary  can  be  obtained  by  using  boundary 
layer  theory.  It  yields  Fresnel  integrals  which  were  used  in  the  method  of  stationary  phase. 
In  the  neighborhood  of  the  separation  point  B  there  is  yet  another  kind  of  asymptotic 
solution,  given  by  the  Fock  function. 

The  asymptotics  of  the  wave  field  in  the  neighborhood  of  the  boundary  S  can  be  obtained 
from  the  exact  solution  of  the  Helmholtz  equation  for  diffraction  by  a  circular  cylinder  of 
radius  a.  In  cylindrical  coordinates  a  mode  of  the  two  dimensional  wave  field  can  be  written 
as 

u  =  eiv6H^\kr).  (14) 

Here  H^\  the  Hankel  function  of  first  kind  [1],  is  outgoing.  Suppose  the  boundary  condition 
is  u(r)\r~a  =  0.  This  leads  to  the  equation 

tfW(fca)  =  0.  (15) 


25 


This  equation  (15)  lias  infinitely  many  complex  roots  vrn .  The  asymptotic  behaviour  of  vm 
for  ka  1  is  given  by 

ym  ~  ka  +  (16) 

When  (16)  is  used  for  u  in  (14),  u  becomes 


=  e 


'eH\}\kr)  -  ei^>+i^/3rm^o)^30 H(l) 


(17) 


The  result  (17)  shows  that  a  single  mode  um(r,6)  decays  exponentially  with  6  at  a  rate 
proportional  to  (A:a)ly/3.  For  a  noncircular  boundary  ,  the  local  decay  rate  can  be  obtained 
by  replacing  a  by  the  local  radius  of  curvature  a(s)  and  setting  dO  —  a”1  {s)ds.  Then  the 
exponent  for  the  ??-th  mode  becomes 


ihs  +  ie”r/3rnjA:1/3  /  (a(a/))  2^3ds.  (18) 

Jo 

The  amplitude  of  each  mode  also  involves  diffraction  coefficients  at  the  point  B ,  where  the 
surface  ray  begins,  and  at  the  point  B*  where  it  leaves  the  surface.  Then  the  total  field  in 
the  shadow  is  a  sum  of  all  the  modes. 

When  a  wave  is  diffracted  by  an  axially  symmetric  object  in  three  dimensions,  the 
diffracted  waves  have  a  caustic  along  the  axis.  This  yields  a  bright  spot  in  the  cross-section 
of  the  shadow.  The  wave  field  on  and  near  this  caustic  can  be  expressed  in  terms  of  Bessel 
functions. 

Now  let  us  consider  diffraction  of  a  normally  incident  plane  wave  by  a  planar  screen 
of  arbitrary  shape,  with  a  smooth  boundary  S .  Instead  of  a  bright  spot,  there  is  a  bright 
curved  line  in  each  normal  cross-section  of  the  shadow.  The  form  of  the  bright  line  is 
given  by  the  evolute  of  the  curve  5,  and  it  is  a  caustic  of  the  edge  diffracted  rays.  For  a 
planar  curve,  the  evolute  is  the  locus  of  centers  of  curvature  of  the  normals  to  the  curve. 
For  example,  for  an  ellipse  the  bright  line  is  given  by  the  Lame  curve,  see  Figure  3.  To 
summarize:  an  asymptotic  solution  will  usually  consist  of  a  sum  of  waves.  Every  wave 
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is  constructed  by  means  of  rays.  They  are  obtained  by  solving  the  ray  equations.  Then 
the  phase  S  and  the  amplitude  zo  are  found  along  each  ray  by  the  formulae  given  above. 
The  other  zm  can  be  found  as  solutions  of  the  appropriate  transport  equations.  This  same 
approach  is  used  in  the  next  sections. 

4  Surface  waves  on  water  of  nonuniform  depth 

Previously  we  suggested  that  in  water  of  nonuniform  depth,  we  could  determine  the  wave 
motion  by  using  the  reduced  wave  equation.  That  suggestion  yields  the  correct  phase,  but 
not  the  correct  amplitude.  Therefore  we  shall  now  present  an  analysis  which  determines 
correctly  both  the  phase  and  the  amplitude1. 

As  before,  we  assume  that  the  water  is  inviscid,  incompressible  and  in  irrotational 
motion.  It  is  bounded  above  by  an  unknown  free  surface  Z  =  y)]  {rj  is  the 

complex  amplitude  of  the  surface  wave  motion  of  angular  frequency  cu)  and  bounded  below 
by  a  rigid,  non-uniform  surface  Z  =  —  H(x,y).  The  exact  linear  theory  of  surface  waves 
yields  for  the  free  surface  height 

/  \  icu  f  . 

7]{x,y)  =  — 0), 

a 

where  Z)  is  the  velocity  potential  and  g  is  the  gravitational  acceleration2. 

The  velocity  potential  satisfies  (see  Stoker  [5]) 

-  0  in  0  ;>  Z  ^  -H{x,  y),  (19) 

3>z  =  /?< I>  on  Z  =  0  (p  =  u2/g),  (20) 

9z  +  HxQx  +  Hy*y  =  0  on  Z  = -H(x,y).  (21) 

In  the  constant-depth  case,  the  solution  decays  exponentially  with  depth,  so  short  waves 
do  not  “feel”  the  bottom.  To  keep  the  influence  of  the  depth  variability,  we  rescale  the 
vertical  axis  by  introducing 

z  =  f3Z,  h  =  /3H,  <j>(x,y,z)  =  $(x,y,Z).  (22) 

Then  the  problem  (19)-(21)  becomes  that  of  finding  solutions  (f)  of  the  set  of  rescaled 
equations 
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(j>z  =  (p  on 

z  —  0, 

(24) 

to 

-e- 

+ 

-O- 

H 

+ 

5- 

II 

o 

on  2:  —  —  h(x,  y ). 

(25) 

JIt  fails  at  the  shoreline,  where  there  is  a  boundary  layer,  see  Section  5. 

2 The  reason  for  using  upper- case  letters  is  to  save  the  lower-case  ones  for  the  rescaled  variables. 
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We  seek  solutions  of  (23)- (25)  for  large  values  of  (3 . 

Motivated  by  the  constant-depth  solution,  we  express  (p  in  the  form 

(p  =  A  cosh  [k(z  +  h)}el(3S.  (26) 

Here  k  =  /c(.r,y),  S  =  S(x,y )  and  A(x,  y,  z,  (3)  are  functions  to  be  determined. 

Plugging  (26)  into  the  system  of  equations  (23)-(25)  leads  to 

/32  ((A:2  -  (VS)2)A  cosh  a  +  cosh  a  +  2k  Az  sinh  a)  + 

+i/ 3  ((V25)A  cosh  a  +  2V5  •  V(A  cosh  a))  +  V2(A  cosh  a)  =  0,  (27) 

Az  cosh  kh  +  kA  sinh  kh  =  A  cosh  kh  z  =  0,  (28) 

(32Az  +  ifiAVh  •  VS  +  Vh  •  VA  =  0  *  =  -/?,  (29) 

Here  a  =  A:(^  +  /?,)  and  V  =  (<9/<9.x,  d/dy). 

Next  we  assume  that  A  admits  the  following  asymptotic  expansion  for  large  (3 : 

oc 

A(x,y,  z,($)  ~  A0{x,y)  +  '^2An{x,y,z)/(i(3)n.  (30) 

n=  1 

Again,  motivated  by  the  constant-depth  case,  we  have  assumed  that  the  first  term  Ao  does 
not  depend  on  the  vertical  coordinate.  We  also  assume  that  termwise  differentiation  in  (30) 
is  allowed. 

Inserting  the  asymptotic  expansion  (30)  into  (27)-(29),  and  equating  coefficients  of  the 
corresponding  powers  of  /?,  we  obtain  the  following  three  systems  of  equations: 

'  (VS)2  =  A:2 

(A\)zz  cosh  a  +  2k(Aj)z  sinh  a  =  2VS  •  V(Aocosha)  +  Ao  cosh  aV2S 
<  ~  ~  (31) 

{An)zz  cosh  a  +  2k(An)z  sinh  a  =  2 VS  •  S7(An- j  cosh  a)  +  An-\  cosh  oV2S'+ 

„  +V2{An-2  cosh  a)  (n  ^  2), 

k  tanh  kh  =  1 

<  (32) 

k  {An)z  =  0  at  2  =  0  (n  ^  1), 

and,  finally, 

(  (4,).  =  AoVh-VS  at,  z  = -h 

{  (33) 

(  {An)z  =  An-]  V/)  •  VS  +  V/i  •  V;4n_2  at  z  = —h  (n  ^  2). 
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The  first  equation  in  (32)  determines  k(x,y)  as  a  function  of  the  known  depth  h(x,y). 
Then  the  eiconal  equation  (VS)2  =  k 2  can  be  solved  for  S(x,  y)  by  the  ray  method  discussed 
in  the  previous  lectures. 


In  order  to  find  the  amplitude  A^  we  use  the  identity 

{{An)z  cosh2  a). 


cosh  a 


(34) 


( An)zz  cosh  a  +  2 k(An)z  sinh  a  = 

Inserting  (34)  into  (31.2)  we  obtain 

{{A{)z  cosh2  a) z  =  (2 VS  •  VA0  +  A0V2S)  cosh2  a  +  A0VS  •  V  cosh2  a.  (35) 
Now  we  integrate  (35)  from  0  to  z,  using  the  boundary  conditions  (32.2) 

{A\)z  cosh2  a  =  ^  (2V5  •  VAo  +  AqV2S  +  AoVS  •  V)  (k~1  [sinh  a  cosh  a  —  sinh  fc/icosh  kh]  +  y)  . 

(36) 

Solving  (36)  will  give  A\  up  to  an  additive  function  of  (x,y),  if  ^4o  is  known. 

Next  we  set  z  =  —h  in  (36)  and  eliminate  (Ai)Zl  using  the  boundary  condition  (33.1). 
This  leads  to  an  equation  for  Aq 

2 A0Vh  •  VS  -  -(2V5  •  V,40  +  A0V2S)(sinh2  A:/?,  +  /i)  +  ,40VS  •  (Vh  -  Vsinh2  fc/i).  (37) 
Equivalently,  (37)  can  be  written 

VS  •  V  (i4g(sinh2  kh  +  h))  +  (^(sinh2  kh  +  h))  V2S  -  0.  (38) 

We  note  that  VS  •  V  =  k(d/dr),  where  r  measures  arc-length  along  a  ray.  Then  the 
solution  of  (38)  can  be  written  in  the  form 


-4o(sinh2  kh  +  h)  =  [yl2(sinh2  kh  +  h)]To  exp  J  k  1  V2Sdr^  . 
In  [4],  Luneberg  has  shown  that  the  exponential  above  is  given  by 


exp 


/v 

Jr 0 


V2Sdr  = 


k(r0)  dfl(ro) 
k(r)  da(r) 


(39) 


(40) 


Here  da(r)  is  the  width  of  an  infinitesimally  narrow  strip  of  rays  at  r.  Plugging  (40)  into 
(39),  we  finally  get  the  following  equation  for  Aq  along  a  ray: 


^4o(sinh2  kh  +  h)k  da  =  const. 


(41) 


Equation  (41)  simply  expresses  the  fact  that  the  energy  flux  is  constant  along  a  tube  of  rays. 
By  using  (41)  for  Ao(x,y)  in  (26),  we  get  the  leading  term  in  the  asymptotic  expansion  of 
<f>  in  water  of  variable  depth. 


The  amplitude  Ao  is  infinite  at  the  shoreline,  where  h  =  0 .  This  means  that  there  is  a 
boundary  layer  at  the  shore.  To  analyze  the  solution  in  this  layer,  we  use  the  shallow  water 
theory,  which  is  introduced  in  the  next  section. 
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5  Shallow  water  theory 


The  main  reference  for  this  section  is  Stoker’s  famous  monograph  [5].  For  simplicity,  we 
consider  the  2-dimensional  case  with  horizontal  a>axis  and  vertical  z-axis.  The  free  surface 
is  given  by  2  =  rj(x,  £),  while  the  bottom  is  z  —  —h(x).  The  equation  of  continuity  for  the 
components  u(x,z,t)  and  v(x,z,t)  of  the  water  velocity  is 

Ux  "f"  vz  —  0.  (42) 

At  the  free  surface  we  have  both  the  kinematic  condition 


(Vt  +  uVx  -  v)  \z=1]  =  0 


(43) 


and  the  dynamic  condition 


The  bottom  boundary  condition  is 


P\z=ri  =  0. 


(44) 


Integrating  (42)  gives 


{uhx  +  v)  |-=_/(  =  0. 


rv 

I  UTdz  V 

J-h 


\V 

I— h 


=  0. 


Using  the  top  and  bottom  boundary  conditions  in  (46)  leads  to 


iixdz  ijt  -)-  'll 


\T] 


•  7]x  +  u  |_/t 


•  hx  =  0 . 


(45) 

(46) 


(47) 


We  can  rewrite  (47)  as 

d  rn 

aiJ.hudz  =  "H-  (48) 

Notice  that  up  to  this  point  no  approximation  has  been  introduced.  The  sole  approxi¬ 
mation  of  the  shallow  water  theory  is  to  ignore  the  vertical  acceleration.  This  is  assumed 
because  the  water  is  shallow.  Hence  the  pressure  is  given  as  in  hydrostatics,  namely 


p  =  gp{iT-z).  (49) 

Here  p  is  the  water  density  and  g  is  the  acceleration  of  gravity.  Differentiating  (49)  with 
respect  to  x  gives 

Px  —  gpvx-  (50) 

Note  that  since  i]x  is  independent  of  y .  so  is  px . 

Next,  we  assume  that  u  is  independent  of  z  at  t  =  0  (This  is  true  if  the  water  is 
initially  at  rest).  This  will  imply  that  u  is  independent  of  z  at  all  times,  since  its  horizontal 
acceleration  p~}px  does  not  depend  on  z  either,  as  (50)  shows.  Then  (48)  becomes 

[u(ri  +  h)]x  =  -i)t  ■  (51) 
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Multivalued  solution 


Figure  4:  The  “bore”  formation. 


The  Eulerian  equation  of  motion  for  u(x,  t )  is 


ut  +  uux  =  -gr]x  . 


(52) 


Eqs.  (51)  and  (52)  constitute  the  (nonlinear)  shallow  water  theory  for  determining  u  and 
V- 

When  u  and  r)  and  their  derivatives  are  small  enough,  we  can  linearize  (51)  and  (52). 
This  yields  the  linear  shallow  water  theory ,  in  which  u  and  rj  satisfy 


Ut  —  grjx , 

<  (53) 

k  (uh)x  =  - r)t . 


Eliminating  7]  from  (53)  yields 

{hu)xx  -  \(hu)tt  =  0.  (54) 

gh 

We  have  multiplied  and  divided  by  h(x)  to  get  the  linear  wave  equation  for  the  quantity 
( hu ).  The  propagation  speed  is  yfgh.  If  h  —  const,  (54)  is  a  linear  wave  equation  just  for  u. 
The  linear  shallow  water  theory  is  used  for  the  tides,  where  large  wavelengths  are  involved. 

The  equations  of  the  nonlinear  shallow  water  theory  admit  an  interesting  analogy  with 
the  differential  equations  of  gas  dynamics.  Let  us  define  p,  the  mass  per  unit  area,  by 

p  =  p(j]  +  h).  (55) 

/■From  (55) 

Pt  =  PVt ■  (56) 
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(57) 


Then  the  force  per  unit  width  p  —  J\pdz  is  given  by 

Now  we  multiply  (52)  by  p(j]  +  h)  to  get 


p(i1  +  h)(ut  +  uux)  =  -3/9(77  +  h)rjx  . 

(58) 

Then  using  (55)  and  (57),  we  can  write  (58)  as 

p(ut  +  uux)  =  -px  +  gphx  . 

(59) 

In  terms  of  p,  we  can  write  (51)  as 

{pu)x  =  -pt. 

(60) 

The  equations  (57),  (59)  and  (60)  are  exactly  the  equations  for  the  one  dimensional  flow  of 
a  compressible  gas  with  adiabatic  exponent  7  =  2  and  an  external  force  gphx  .  This  force 
vanishes  when  the  depth  is  uniform.  The  sound  speed  is 

c  =  =  \/g(i]  +  //.)•  (61) 

This  is  the  speed  of  a  small  disturbance. 

As  in  gas  dynamics,  the  solutions  of  the  nonlinear  shallow  water  equation  cease  to  be 
single  valued  at  a  finite  time  for  certain  initial  conditions.  They  can  be  made  single  val¬ 
ued  by  introducing  a  discontinuity,  called  a  “shock”  in  gas  dynamics,  and  a  “bore”  in  water 
waves.  See  Fig.  4.  Such  discontinuities  can  be  observed  in  some  rivers,  and  in  kitchen  sinks. 

Notes  by  Khachik  Sargsyan  and  Walter  Pauls. 
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Lecture  5:  Amplitude  Dynamics,  Boundary  Layers,  and 

Harbor  Resonance 

Jospeh  B.  Keller 


1  Amplitude  Amplification  at  the  Shore 

We  will  now  consider  the  consequences  of  ray  theory  for  the  amplification  of  wave  ampli¬ 
tudes  near  a  shoreline.  In  the  deep  ocean,  tsunami  waves  have  small  amplitude  and  long 
wavelength,  on  the  order  of  tens  of  kilometers.  But  when  they  reach  the  shore,  as  mentioned 
in  the  last  lecture,  they  can  grow  to  towering  heights.  We  can  gain  insight  into  the  nature 
of  this  amplitude  growth  using  the  linear  methods  of  the  previous  lecture. 

We  will  consider  the  evolution  of  a  wave  train  incident  on  a  sloping  beach.  Let  the  depth 
be  a  linear  function  of  distance  from  the  shore 


h  = 

=  x  tan  a  ~  ax ,  a<l 

(1) 

Following  the  method  of  the  previous  lectures,  factor  the  velocity  potential  into  a  horizontal 
oscillation  and  a  vertical  mode  shape 

<f>  (x,  y,  z) 

=  Ac,osh{k(z  +  h))ei0S{x'y). 

(2) 

Consequently, 

(VS1)2  =  k1 
k  tanh  kh  =  1. 

(3) 

(4) 

We  now  specialize  to  the  case  of  one  dimensional  long  waves.  These  are 
wavelength  compared  to  the  depth  and  wavefronts  parallel  to  the  shore, 
these  assumptions  imply 

waves  with  long 
Mathematically, 

kh  <  1 

5  =  S(x). 

(5) 

(6) 

In  this  limit,  Equation  (4)  becomes 

k2h  =  1, 

(7) 

which  implies 

k=  A=. 

\fh 

(8) 
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Therefore,  as  the  waves  approach  shore  and  h  — >  0,  k  — >  oo  and  the  wavelength  decreases 
to  zero.  The  parameter  kh.  however,  remains  small  throughout: 

kh  =  l  -►  0 

Incidentally,  the  one  dimensional  wave  approaching  parallel  to  the  shore  is  the  most 
physically  relevant  case.  Recall  from  Lecture  3  the  equation  for  the  path  of  characteristic 
curves 

£vs=£v«*(x).  (9) 

Using  n2(x)  =  /j.q//i(x),  wo  have 


ho 

h2(x) 


V/i. 


(10) 


Since  VS  is  parallel  to  rays,  and  — V//,  points  in  the  direction  of  the  shore,  this  equation 
shows  that  rays  curve  in  the  direction  of  the  shore.  Figure  1  illustrates  the  intuition  behind 
this  result.  As  a  wave  approaches  a  beach  at  an  angle,  the  section  of  the  wavefront  further 
from  the  beach  is  over  deeper  water,  and  therefore  has  a  relatively  faster  wave  speed.  Ac¬ 
cordingly,  the  wavefront  will  swing  towards  the  shore  until  it  is  parallel  with  the  beach,  and 
all  points  on  the  wavefront  have  the  same  wave  speed.  For  this  reason,  the  one  dimensional 
formulation  is  adequate  to  investigate  the  late  stages  of  a  wave’s  approach  to  the  shore. 

The  reduction  to  one  dimension  also  makes  it  easy  to  compute  the  phase.  We  have 


dS 

dx 


±/r 


± 


1 

Tfi 


± 


i 

\fx.  t  an  o 


(11) 


Though  the  derivative  is  singular  at  x  =  0,  it  is  integrable  and  we  can  compute  the  phase 


S(x) 


Jxo  y/h(x  ^ 

fx  j 


7a  o  tan  o 

2,.,e  +c 


dx! 


\/  tan  o 


(12) 


We  choose  the  minus  sign  to  be  consistent  with  waves  moving  toward  the  shore. 

We  are  now  prepared  to  compute  the  amplitude  evolution.  Because  the  rays  are  straight 
lines  in  one  dimension,  the  ray  tube  area  da  =  constant.  We  showed  in  Lecture  4  that 


Aq  (sinh2  kh  +  h)  kda  =  constant. 
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Arrives  parallel  to  shore 


Figure  1 :  Waves  on  a  sloping  beach  are  refracted  so  as  to  approach  with  wavefronts  parallel 
to  the  shoreline 
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Therefore, 


k  (h  +  sinh2  kh ) 
1 

k  ()t2/;,2  +  h) ' 


Using  equation  (  8),  we  have 


,  _  C' 
A°  fcl/4 


Thus,  the  theory  predicts  wave  amplitudes  going  to  infinity  at  the  shoreline.  That’s  why 
tsunamis  do  so  much  damage. 

This  result,  seems  natural  considering  that  we  have  held  the  example  of  a  tsunami  in 
mind  as  we  derived  it.  But  one  might  ask  what,  in  this  theory,  differentiates  tsunamis  from 
any  other  wave?  The  ocean  is  full  of  waves  satisfying  kh  1  that  reach  shore  without 
catastrophic  consequences.  Clearly,  something  is  wrong  with  the  theory  very  close  to  the 
shore. 

A  typical  trouble  with  asymptotic  theories  is  the  presence  of  certain  regions  where  the 
solutions  become  singular.  In  these  regions,  the  asymptotic  expansions  fail.  We  need  a 
different  theory  that  applies  in  this  singular  region,  which  we  call  a  boundary  layer.  If  all 
goes  well,  we  will  be  able  to  find  a  solution  that  applies  in  the  boundary  layer  and  that 
blends  continuously  into  exterior  solution  we  have  just  derived. 


2  Boundary  layer  and  shallow  water  equations 

2.1  The  concept  of  the  wave  boundary  layer 

The  asymptotic  analysis  based  on  the  linear  wave  theory  discussed  above,  which  was  asymp¬ 
totic  in  the  sense  that  the  depth  and  wavelength  were  small  compared  to  the  characteristic 
horizontal  scale,  proved  to  be  successful  for  deep  water  waves  (see  the  previous  lecture  and 
[3]).  However,  the  theory  fails  in  the  proximity  of  the  shore,  yielding  an  infinite  amplitude 
there.  Wave  propagation  near  the  shore  can  be  analyzed  by  means  of  the  shallow  water 
theory  and  the  boundary  layer  concept. 

The  idea  is  to  use  the  shallow  water  equations  in  the  vicinity  of  the  shore,  where  the 
depth  is  small,  and  the  wavelength  is  large  compared  to  the  depth.  It  is  worth  noting, 
however,  that  shallow  water  theory  is  applicable  over  large  regions  of  ocean  basins,  if  we 
study  phenomena  on  the  synoptic  scale,  large  ( L  &  100  km)  compared  to  the  mean  depth 
of  the  basin  ( H  %  500  -  1000  m),  see  e.g.  [4]. 

First  of  all,  we  will  ruminate  for  a  while  on  the  derivation  of  the  shallow  water  equations, 
formulated  for  an  incompressible  fluid  in  an  inertial  frame  -  Coriolis  acceleration  will  not 
be  included. 

Naturally,  at  this  point  anyone  merely  conversant  with  physical  oceanography  1  would 
ask  about  the  relevance  of  linear  shallow  water  theory  to  the  reality  which  occurs  on  the 
noninertial  frame  of  the  rot  ating  Earth,  where  the  the  Coriolis  acceleration  should  be  taken 

]and  wo  assume  t  hat  some  of  our  readers  belong  to  t  his  group 
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into  account.  In  fact,  this  effect  is  crucial  in  studies  of  tidal  phenomena  as  well  as  for 
the  more  general  model  of  Poincare  waves  (the  latter  problem  being  formulated  in  the 
shallow  water  approximation  in  the  synoptic-scale  ocean  gives  dispersive  waves).  However, 
neglecting  the  coriolis  force  may  be  justified  in  two  classes  of  wave  problems: 

•  waves  propagating  in  lakes  and  small,  shelf  seas  (i.e  the  Baltic  sea)  where  the  long 
(with  respect  to  the  basin  depth)  waves  are  locally  generated  by  a  strong  wind  yet 
they  are  short  enough  not  to  be  effected  by  the  Coriolis  acceleration, 

•  Shoaling  waves  approaching  normal  to  a  shoreline.  Then  the  waves,  irrespective  of 
whether  they  were  nondispersive  or  dispersive  far  from  the  shore  due  to  the  Coriolis 
effect,  undergo  shoaling  on  scales  on  which  the  Coriolis  effect  does  not  contribute. 


3  The  structure  of  the  boundary  layer 

While  toying  with  the  idea  of  the  horizontal  wave  boundary  layer  t)  for  the  problem 
of  waves  approaching  the  shore  and  affected  by  its  presence,  we  could  attempt  to  find  an 
analogy  with  the  vertical  terrestial  and  oceanic  boundary  layer  (table  1).  By  this  analogy, 
we  may  expect  that  the  solution  obtained  for  the  wave  boundary  layers  may  be  matched 
at  the  borderline  between  the  boundary  layer  and  the  outer  one,  just  like  in  the  case  of 
vertical  boundary  layers. 


3.1  Shallow  water  equations 

We  consider  long  waves  propagating  in  relatively  shallow  water  in  an  inertial  frame  of  ref¬ 
erence.  We  assume  two-dimensional  motion  in  the  (x,  y )  plane.  The  equation  of  continuity 
for  an  incompressible  fluid  is: 

ux  +  wz  =  0,  (15) 

The  kinematic  condition  and  the  dynamic  conditions  at  the  free  surface  are: 


(j)t  +  'ttTjx  tc)  \z==Tj  —  0,  p\Z—Tj  —  0  (16) 

The  kinematic  condition  at  the  bottom  is: 


(uhx  T  vj)\z=—h  —  0,  p\ z=rj  —  0 


(17) 


It  is  convenient  to  formulate  the  problem  in  terms  of  the  depth  integrated  horizontal  velocity, 
namely: 


A 

dx 


r 

J-h 


udz , 


using  the  boundary  conditions  and  the  Leibniz  rule  of  integration: 

dx 


/v 

-h 


udz 


~Vt 


(19) 


In  the  shallow  water  theory  the  hydrostatic  pressure  approximation  is  used.  That  is,  vertical 
acceleration  is  ignored.  Then,  the  pressure  at  a  point  is  determined  entirely  by  the  weight 
of  the  water  column  above  it: 


P  =  9P(71  -  z) 


(20) 
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vertical  (terrestial / ocean ) 

horizontal  (wave) 

ground /free  water  surf  ace/ice 

shoreline 

Prandtl  layer 

shore  wave  boundary  layer  5  where  the  shallow  water  theory 
applies;  for  the  sake  of  use  of  asymptotic  methods,  0  can  be 
further  divided  into  inner  shore  wave  boundary  layer  in 
the  immediate  proximity  of  the  shoreline  and  the  outer  shore 
wave  boundary  layer  D0 

Ekman  layer 

the  outer  layer  where  the  shallow  water  might  also  apply 
but  the  Coriolis  term  should  be  taken  into  account  causing 
waves  to  be  dispersive 

free  flow  (geostrophic  layer) 

the  outermost  layer  where  gravity  deep  water  wave  theory 
applies 

Table  1:  Boundary  layer  structure  and  the  analogy  between  vertical  and  horizontal  bound¬ 
ary  layers. 
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The  horizontal  pressure  gradient  is  then  px  =  gpr]x.  From  the  equation  of  motion  in  the  x 
direction,  ut  +  uux  =  —gr]x,  the  horizontal  acceleration  is  independent  of  depth.  Therefore 
so  is  u,  provided  that  it  was  initially  independent  of  z.  The  depth  integrated  u  is  now 
J^hudz  =  u(g  +  h).  Using  this,  we  obtain  the  nonlinear  shallow  water  equations: 

ut  +  uux  =  —ggx  (21) 

(u(h  +  g))x  —  —rjt,  (22) 

Where  g  =  g(x,t),  u  =  u(x,t),  h  =  h(x).  If  we  assume  that  u,  g  and  their  derivatives 
are  small,  their  products  can  be  neglected  compared  with  linear  terms.  Then  (21)  and  (22) 
yield  the  linear  shallow  water  equations  : 

ut  =  -grjx  (23) 


(uh)x  =  -gt 

Eliminating  rj  from  (23)  and  (24)  gives 

(uh)xx  = 
9 


(24) 

(25) 


3.2  Linear  SWE  and  the  variable  depth  -  asymptotic  approach 

Since  h  =  h(x)  is  independent  of  t)  we  can  rewrite  (25)  as 

(uh)xx  -  =  0.  (26) 

This  is  the  wave  equation  for  a  variable  U*  =  uh  with  propagation  velocity  c  =  y/gh{x). 
For  time  harmonic  waves  U*{x ,  t)  =  U(x)  exp(— iut),  (26)  becomes  the  Helholtz  equation: 

Uxx  +  -£tU  =  0.  (27) 

gh{x) 

We  now  define  k  =  u/yfgho,  n(x)2  =  gho/gh(x),  in  terms  of  a  typical  depth  ho-  Then  we 
can  rewrite  (26)  as 

Uxx  +  k2n2{x)U  =  0.  (28) 

Away  from  the  shoreline  h(x)  =  0,  the  asymptotic  form  of  U(x,k )  for  kho  »  1  is 

U(x)  w  Zin(x )  exp  (ik  Sin(x ))  +  Zr(x )  exp  ( ik  ST(x)).  (29) 

Here  Zin(x),  Sin(x)  and  Zr{x),  Sr(x)  are  the  amplitudes  and  phases  of  the  incident  and 
reflected  waves,  respectively.  We  call  (29)  the  outer  asymptotic  expansion  of  U.  It 
is  not  valid  where  h(x)  =  0  because  Zm(x),  Zr(x)  become  infinite  there.  To  determine 
U(x)  near  the  shore,  we  define  x'  =  kx  and  V(x',k)  =  U(x,k).  Then  h(x)  =  h ( ^ )  = 
h( 0)  +  hx(0)( y)  +  0(k~2).  Then  at  the  shoreline  x  =  0  we  get  h( 0)  =  0  and  we  define  a, 
the  slope  of  the  bottom,  by  tan  a  =  hx( 0).  Then  n(x)2  =  ho/h(x)  —  (kho)/(x'  tan  a)  +  0(1) 
and  (28)  becomes: 

Vx'x'+{-^-^  +  O(l))V  =  0.  (30) 

tan  a  x 
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When  we  neglect  the  0(1)  term,  (30)  becomes  a  form  of  the  Bessel  equation.  Although  the 
coefficient  of  V  is  singular  at  x(  =  0,  the  equation  has  a  solution  which  is  regular  there.  It 
is 

V(x',k)=AV^M  (31) 

Here,  A  is  an  arbitrary  constant.  Then  the  solution  of  (31)  vanishes  at  x*  —  0.  There  is 
also  a  solution  which  is  infinite  at  x 1  =  0.  The  asymptotic  solution  of  (31)  for  x 1  large  is: 


This  can  be  matched  with  the  outer  expansion  (29).  The  system  of  linear,  variable  depth 
shallow  water  equations  is  satisfactory  for  small  amplitude  waves.  It  does  not  capture  effects 
like  breaking,  for  which  the  nonlinear  theory  is  needed. 

4  Nonlinear  Wave  Propagation  Along  Rays 

In  these  lectures,  we  have  discussed  the  linear  theory  of  waves  in  some  detail.  It  would  be 
a  shame  not  to  discuss  nonlinearity  a  little  further.  For  decades,  models  of  water  waves 
have  been  an  interesting  source  of  nonlinear  equations.  One  of  the  most  famous  of  these 
equations  is  that  of  Kordeveg  and  de  Vries,  which  was  derived  to  model  the  cumulative 
effect  of  nonlinearity  in  water  waves  travelling  over  long  distances. 

In  this  section,  we  will  derive  KdV  in  the  context  of  a  ray  tracing  theory  for  nonlinear 
long  waves  on  a  layer  with  spatially  varying  depth.  In  linear  theories,  the  amplitude  typically 
takes  the  form  of  a  nearly  sinusoidal  wave  train  with  amplitude  and  wavenumber  slowly 
varying  along  a  ray.  Instead,  we  will  find  that  the  amplitude  is  governed  by  an  equation  of 
KdV  form.  Specifically,  we  will  consider  the  equations  for  a  disturbance  on  the  surface  of  an 
incompressible  flow  of  constant,  density  without  rotation.  This  computation  is  a  simplified 
presentation  of  a  more  general  analysis  presented  in  [5],  in  which  the  effects  of  bottom 
topography,  incompressibility,  rotation,  stratification,  and  a  polytropic:  equation  of  state 
are  taken  into  account.  By  including  only  one  dimensional  bottom  topography,  we  will  be 
examining  the  simplest  case  in  which  nontrivial  amplitude  dynamics  occurs. 

4.1  Scaling  the  Equations  of  Motion 

We  begin  by  introducing  a  carefully  chosen  scaling  of  the  equations  of  fluid  motion.  The 
key  method  of  asymptotic  analysis  is  to  rescale  equations  to  introduce  small  parameters, 
thus  allowing  complex  problems  to  be  considered  as  a  sequence  of  simpler  problems.  The 
art  of  this  method  is  to  tailor  one’s  scaling  to  access  a  physically  interesting  limit.  In  this 
case,  the  scaling  will  pertain  to  waves  with  wavelength  long  compared  to  the  depth  of  the 
layer,  propagating  over  long  distances. 

Let  us  consider  a  layer  of  incompressible  fluid  in  two  dimensions  bounded  above  by  the 
free  surface  z*  —  if  (.7:*,  /*)  and  below  by  the  rigid  surface  z*  =  —  h*  (x*).  Before  writing  the 
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equations  of  motion,  we  introduce  stretched  dimensionless  variables  x,z,t,  etc.,  as  follows: 


<  =  («  )2/3. 

/i*  -  Hh, 

p*  ---  gHpop , 

(u*,w*)  =  y/gH  (u,ell2w)  . 


(x*,z*)  =  H  (e-V2x,z) , 
V*  =  Hr], 

t*=e-3/2t(H.y/2 


(33) 


and  v  =  (u7w).  Here,  L  is  a  typical  horizontal  scale  of  variation,  so  e  is  determined  by  the 
characteristic  aspect  ratio  of  the  motion. 

In  these  stretched  variables,  the  equations  of  motion  take  the  following  form: 


dw  du 
dz  +  6  dx 


du  du  dp 
dt  U  dx  dx 


+  w 


du 


dw  dw 

~di +ufa 


)dw  dp 

+  ew-dl  +  ¥z 


dz 

+  1 


o, 

o, 

0. 


(34) 

(35) 

(36) 


The  kinematic  condition  and  the  normal  force  balance  at  the  free  surface  are,  respec- 
tively, 


dp 

di 


di] 

+  u¥ 


p 


w, 

C, 


(37) 

(38) 


evaluated  at  z  —  ?/(x).  The  kinematic  condition  at  the  lower  boundary  is 

w  =  —eu^-  (39) 

ox 

evaluated  at  z  =  —  h(x). 

When  e  =  0,  equations  (  34)  -  (  39)  have,  as  a  solution,  the  state  of  rest  given  by 

vo  =  0,  (40) 

po  =  C-z  (41) 

Vo  =  0.  (42) 


To  find  approximate  solutions  for  e  /  0,  we  introduce  a  phase  function  5  (x,  t)  and  the 
“fast”  variable  £  =  e-15'.  We  then  express  v,  p,  and  p,  as  functions  of  £  as  well  as  of  x,z,t, 
and  e.  We  also  assume  that  these  functions  posses  asymptotic  expansions  in  e  of  the  form 

v  (£,  t,  x,  z,  e)  ~  v0  (t,  x,  z)  +  eva  (£,  t,  x,  z)  +  e2v2  +  •  •  •  ,  (43) 


where  the  variables  with  subscript  0  are  the  rest  state  solutions  given  above.  Under  this 
change  of  variables,  the  derivatives  transform  as 


d_ 

dx 

d_ 

dt 


d_ 

dx 

d_ 

dt 


+  e~lS: 


d 


+  e 


xdt’ 

d 


(44) 

(45) 
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Since,  ||  =  0,  the  equations  of  motion  become 

0,  (46) 

0,  (47) 

0.  (48) 

The  boundary  conditions  become 

+  +  =  °'2  =  ’''  <«) 

dli 

w  +  «/.—  =  0,  2  =  —h.  (50) 


du  du  dp 
dt  U  dx  dx 


du  du  dw 

efc+bxdl  +  lh 
,  /  c  ,  C  \  du  .  o  dP  .  du 

+  {St  +  uSx)  —  +  Sx-  +  w— 


dw  dw 
~dt  +  Udx 


,  dw  _  .  dw\  dp 

+elwm  +  {S,  +  uS’)M)  +  te  + 


Wc  will  now  substitute  the  asymptotic  series  forms  into  these  equations  and  equate 
coefficients  of  successive  powers  of  e.  Additionally,  we  transform  the  boundary  conditions  at 
2  =  r)  into  boundary  conditions  at  z  =  0  by  writing  the  boundary  terms  as  a  taylor  expansion 
around  z  =  0.  In  this  way  we  obtain  sets  of  equations  for  the  successive  determination  of  5 
and  of  the  various  coefficients  in  the  asymptotic  expansion  of  the  solution. 

Equating  the  coefficients  of  order  e  yields 


du]  dwi 

x  dt  +  dz 

=  o, 

(51) 

,  du i  dpi 

‘  dt  +  dt 

=  0, 

(52) 

dpi 

dz 

=  0, 

(53) 

c  dm 

Si~K7  ~  u’l 

dt 

—  0,  z  0, 

(54) 

Pi 

=  Pi,  z  =  o, 

(55) 

Wl 

=  0,  z  —  —h. 

(56) 

4.2  Modes  Structure  and  the  Eiconal  equation 

We  can  solve  equations  (  51)-(  56)  for  the  structure  of  wave  solutions  at  leading  order. 
A  PDE  governing  the  phase  function  S  will  emerge  as  a  solvability  condition  for  these 
equations,  and  will  be  seen  to  be  equivalent  to  the  eiconal  equation  of  shallow  water  theory. 
First,  we  eliminate  ij\  and  U]  using  equations  (  51)  and  (  55)  to  write 


dip  _  dpi 

dt  dt' 

du\  _  1  dw\ 

dt  Sx  dz 


(57) 

(58) 
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Equality  in  equation  (  57)  holds  for  all  Z  by  equation  (  53).  The  system  simplifies  to 

St  dwi  dpi  _ 

S,  8z  +  ’  rl‘  ' 


-  °'2=0-  <61> 

w\  =  0 ,  z  =  —h.  (62) 

In  order  to  solve  these  equations,  we  seek  a  product  solution  of  the  form 

Pi  =  (63) 

8  A 

W\  =  -St—4>(x,z).  (64) 

These  forms  are  analogous  to  that  used  in  Lecture  3,  the  well  known  WKB  ansatz.  In 
those  cases  the  solution  consists  of  a  rapid  sinusoidal  oscillation  with  a  slowly  varying 
amplitude.  In  the  present  case,  we  also  imagine  the  solution  will  take  the  form  of  a  rapidly 
oscillating  waveform,  represented  by  A(£,£),  with  a  slow  modulation  and  vertical  structure 
represented  by  (f)  and  ?/>.  However,  because  of  nonlinearity,  the  fast  waves  do  not  take  the 
form  of  sinusoids.  Rather,  the  appropriate  wave  shape  will  emerge  from  the  analysis. 
Substituting  these  solution  forms  into  equations  (  59)-(  62)  and  simplifying  yields 

^  =  — 02(a:,  t)~,  (65) 


(f)-,  z  0, 
0,  z  —  — h. 


where 


02(x,  t) 


This  is  a  first  order  system  of  ordinary  differential  equations  in  2  in  which  x  and  t  appear 
only  as  parameters.  For  the  system  to  have  a  solution,  we  must  have 

■&  =  h.  (70) 

O  rp 


A  particular  solution  of  the  system  is  then 


w  i  = 


W)  (f + o . 


Sj  ip(x)  . 
Sx  h  ’ 
Axp(x), 

Atp(x). 
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(72) 

(73) 

(74) 


Note  that,  in  principle,  -0  niay  have  an  arbitrary  x  dependence,  and  u\  .  This  freedom 
corresponds  physically  to  the  fact  that  an  arbitrary  slowly  varying  order  e  height  field  could 
be  added  to  771 ,  and  an  arbitrary  order  e  velocity  field  U(x,z,t )  could  be  added  to  iq,  that 
would  have  to  be  balanced  only  at  higher  order  due  to  the  form  of  equations  (  34)-(  39). 
Since  we  are  not  interested  in  the  interaction  of  waves  with  higher  order  mean  flows,  we 
will  assume  U  =  0  and  ^  —  0. 

Equation  (  70)  is  the  same  as  the  eiconal  equation  computed  when  we  derived  shallow 
water  theory.  To  solve  this  equation  we  can  use  the  method  of  characteristics,  as  in  the 
previous  lectures.  It  is  interesting  to  note  that  the  modes  and  rays  we  have  computed  are 
the  same  as  those  determined  by  the  linear  theory  of  wave  propagation.  It  is  only  in  the 
determination  of  the  amplitude  A(£,t)  that  nonlinearity  plays  a  role,  and  to  that  we  now 
turn. 


4.3  Amplitudes 

To  determine  the  equations  governing  the  amplitude  function  we  must  analyze  the 

set  of  equations  obtained  by  equating  the  coefficients  of  order  e2  in  equations  (  34)-(  39). 
Doing  so,  we  obtain 


c  dU2 

dw2 

du\ 

s*ar+ 

dz 

9?  +  s, 

&P2 

dui 
*u  1  — — 

d£  3 

'  dt 

dt 

1  d£ 

1  to 

Q  dun 
~  Si  dt  ’ 

<J  dr)2 

>3*  “ST"  ' 

dt 

-  W2 

il 

1 

op  |  S3 

a  d'l  1 

bt  at 

dpi 

P‘2 

=  m  -  vi 

dz 

= 

dh 

-h. 

W2 

dx 

W\ 


du\  dpi 
d  z  dx  5 


0, 


dz  ’ 


(75) 

(76) 

(77) 

(78) 

(79) 

(80) 
(81) 


This  system  is  an  inhomogeneous  form  of  equations  (  51)-(  56),  with  forcing  given  by  the 
solutions  computed  at  lower  order.  Substituting  in  the  solutions  found  in  equations  (  71)- 
(  74)  yields 


St  dw2 
Sx  dz 


+ 


,  dp2 
'x  d£ 
dp2 


dz 

O  dP2 

6/-0T-  -  W2 

dC 


w2 


Gi(t,x,z,t), 

(82) 

(83) 

G3(£,x,t),z  =  0, 

(84) 

dh 

(85) 

U]ar’ 2  =  ~  l~ 
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where 


Gi  = 

s|g 

1 

</>►  | 

(86) 

g2  = 

(87) 

Gz  = 

.  (88) 

As  before,  we  seek  a  solvability  condition  for  this  system.  This  time,  the  condition  will 
impose  a  constraint  on  A  that  will  allow  us  to  solve  for  the  amplitude  along  rays. 

Begin  by  solving  (  83)  for  p2 


P2 


=  /  zG2dzf  +  P  (£,  x,  t) . 

J-h 


Inserting  this  solution  into  equation  (  82)  and  integrating  over  z  gives 

Si 


=-f />-!£/: 


dG2  ,  „  ,  ,  SldP 
■aT dz  dz  ~  + 


Applying  the  boundary  condition  at  z  —  —h  gives 

9P  St  dh 

St—  =  D  +  ——tpA. 


dz 


hST  dx 


Finally,  applying  the  boundary  condition  at  2  =  0  gives  the  constraint 


&  r  G  dj ,  %  f°  f ' 

TtLGidz  +  s;LL 


dG2 

di 


dz"dz'  =  -Gz  ~  St 


t 


dG2 

dt 


dz'  + 


St  dh 
hSx  dx 


(89) 


(90) 


(91) 


i>A.  (92) 


By  substituting  for  G\,  G 2,  and  Gz  into  (92),  and  making  extensive  use  of  equation  (  70), 
we  find  an  equation  for  A: 


dA 

~di 


d3A 

~dt 


(93) 


The  sign  of  the  right  hand  side  is  determined  by  the  branch  of  the  solution  to  the  eiconal 
equation  that  is  selected.  It  is  negative  for  rightward  travelling  waves,  and  positive  for 
leftward  travelling  waves. 

Equation  (  93)  is  of  KDV  form,  with  a  linear  growth  term  reflecting  the  expansion  and 
compression  of  ray  tubes  in  space-time.  Note  that 


1  dh  _  d 
2 Vh  dx  dx 


is  the  gradient  of  the  ray  speed,  and  thus  reflects  expansion  and  contraction  of  ray  tubes. 
For  a  rightward  travelling  wave  travelling  into  deeper  water,  ^  >  0,  ray  tubes  expand,  and 
equation  (  93)  predicts  the  decay  of  A  along  a  ray. 
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Figure  2:  The  configuration  of  the  cavity  resonator. 


Let  us  now  examine  the  case  of  constant  depth,  in  which  equation  (  93)  reduces  to  the 
form 


dA  dA  d^A 

~m~x'A-ai  +  X2W 


=  0 


(94) 


where  A]  and  A2  are  constants.  If  we  seek  travelling  wave  solutions  of  the  form  A(v ),  where 
v  —  £  —  ct,  we  find  the  ODE 


-cA' +  XiAA*  +  \2A"'  -0. 

One  periodic  solution  to  this  equation  is  the  Jacobi  function, 

A  =  crcri  (A-1  (£  —  ct))  . 

These  are  “cnoidal”  waves.  They  resemble  cosines,  but  with  flatter  troughs  and  sharper 
peaks.  Furthermore,  the  wavespced  c  is  a  function  of  the  amplitude  a. 

It  is  helpful  to  be  quite  clear  about  the  physical  picture  of  wave  propagation  that  has 
emerged  from  this  analysis.  Due  to  the  eiconal  equation  (  70),  surfaces  of  constant  £ 
propagate  with  the  shallow  water  wave  speed  \fh.  The  amplitude  equation  has  solutions 
that  propagate  relative  to  surfaces  of  constant  £.  Thus,  for  example,  the  full  propagation 
velocity  of  the  cnoidal  wave  solutions  above  is  v  =  VTi  +  c(cr).  It  is  possible  for  these 
nonlinear  disturbances  to  travel  at  supercritical  speeds. 

5  Closed  and  semiclosed  basins 

Asymptotic  methods  will  now  be  applied  to  determine  waves  in  semiclosed  basins  linked 
to  the  ocean  by  a  small  opening,  such  as  harbors  and  marinas,  subjected  to  the  wave  field 
incoming  from  the  open  ocean.  The  configuration  considered  here  is  a  semicircular  cavity, 
with  the  origin  at  the  centre,  as  shown  in  figure  2.  The  boundary  T  consists  of  a  circular  arc 
with  r  =  a  and  two  straight  lines.  The  opening,  of  half-width  s,  is  small  compared  to  the 


4fi 


radius  of  the  cavity.  The  system  is  a  two-dimensional  version  of  an  acoustic  Helmholtz 
resonator,  and  will  lead  to  harbour  resonance.  The  problem  may  be  stated  as  follows: 
given  a  prescribed  potential  at  infinity  (p^,  corresponding  to  a  plane  wave  incident  at  an 
angle  a,  that  is  4>oo  =  e*fc(IC0SQ:+i,sinQ:),  find  the  potential  (p(x,y)  satisfying: 


V20  +  k2  <j>  =  0 


dtp 

dn 


on  T 


<j)  ->  4>oo{x,y)  +  <ps 


(95) 


Here  (ps  is  an  outgoing  wave  potential.  The  problem  can  be  solved  numerically,  but  to  get  an 
analytic  insight  into  the  behavior  of  the  solution,  we  shall  adopt  the  asymptotic  approach 
presented  in  [2].  For  a  small  opening  ks  «  1,  the  disturbance  due  to  the  cavity  as  seen 
from  afar  is  that  of  a  point  source  of  some  strength  m.  Therefore,  we  can  write: 

(p-  (poo  -  (1/4 (kr).  (96) 

Expanding  the  Hankel  function  in  (96)  for  kr  — >  0: 

<f>  =  4> oo (0?  0)  -  (l/4)im[l  +  (2i/7r)  log((l/2)7fcr)]  +  0(mk2r 2  log(fcr)),  (97) 

where  log 7  =  0.577..  is  Euler  constant.  Note  that  0co(O,O)  —  1  and  define  <p°  =  1  — 
(l/4)im  +  (m/27r)  log((l/2)7&s).  Three  asymptotic  expansions  are  needed. 

1.  The  outer  expansion  (pouti  valid  in  the  infinite  region  away  from  the  opening.  The 
opening  appears  as  a  source  and  the  solution  is: 

(pout  ->  (p°  +  (m/2n)  log (r/s),  r/s  «  1,  (98) 


2.  The  potential  in  the  cavity: 

(pin {x,  y)  =  -m  <pc(x,  y).  (99) 

Here  cfrc  is  a  mode  of  the  closed  basin.  As  r  — »  0: 

<t>c  (2vr)_1  logr  +  const.  (100) 

^From  the  first  two  equations  in  (95),  the  solution  (f>c  is  given  by: 

4 (Pc  =  Y0(kr)  -  Jo(fcr).  (101) 


3.  A  potential  <pc  in  the  neighborhood  of  the  opening,  obtained  by  conformal  mapping 
(see  [2]). 

Matching  (pc  to  the  expansion  in  the  cavity,  and  to  that  outside  of  it,  gives  the  source 
strength  m  in  terms  of  the  conductivity  C.  The  source  strength  m  corresponds  to  the 
flux  through  the  opening,  given  by  m  =  C{<j>in  —  (p0ut ),  where  C  is  the  conductivity  of  the 
opening. 
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With  the  value  of  rn  determined,  the  value  of  the  potential  on  the  cavity  wall  is: 

m  2  (  1 

C  2kair  J'(ka)  irka  \Y'(ka)  +  J'(ka)  [i  -  |(l  +  log^fcs)] 

An  example  of  the  potential  response,  given  by  (102)  and  (99)  for  a  large  cavity  for  three 
different  opening  widths  (characterized  by  the  values  of  5)  is  presented  in  figure  (3a).  The 
peaks  of  the  response  occur  when  the  term  in  the  denominator  in  (102)  is  small.  Due  to 
the  oscillatory  behavior  of  the  Bessel  functions,  this  coincides  with  zeros  of  either  J\  ( ka )  = 
—  J(}(ko,)  or  Y\(ka)  ~  —Y^(ka.)  [1]  see  also  figure  (3b).  In  case  when  ka  is  small,  the  Yq  term 
dominates,  and  this  corresponds  with  the  highest  peak  in  figure  (3a),  which  defines  the 
Helmholtz  mode.  For  the  larger  values  of  ka,  the  Jq  term  takes  over  and  condition  Jq  =  0 
determines  the  position  of  the  natural  eigenmodcs  of  the  closed  cavity.  For  the  cavity  with 
small  opening,  the  response  is  modified  by  the  effect  of  the  —  log(7 ks)  term,  large  when  ks 
is  small.  We  can  also  observe  the  influence  of  the  opening  size  s :  reduction  in  the  size  s 
moves  the  Helmholtz  peak  to  smaller  frequency  and  increases  the  amplitude  of  the  response. 

These  results,  obtained  by  asymptotic  methods  were  tested  in  [2]  against  numerical 
solutions.  They  provide  a  theoretical  explanation  of  harbor  resonance,  a  phenomenon  of 
practical  importance  in  ocean  engineering. 

Notes  be  Alex  Hasha  and  Inga.  Koszalka. 
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Figure  3:  (a)  Response,  given  by  (102)  and  (99)  for  a  large  sector  cavity,  with  unit  source 
strength  m  =  1,  for  three  different  values  of  the  opening  width  s.  (b)  The  shape  of  the 
bessel  functions  —Ji(ka)  ~  JUka)  and  —  Yj(ka)  —  Y^(ka)  for  different  values  of  ka. 


Lecture  6:  Wiener  Process 


Eric  Vanden-Eijnden 


Chapters  6,  7  and  8  offer  a  (very)  brief  introduction  to  stochastic  analysis.  These 
lectures  are  based  in  part  on  a  book  project  with  Weinan  E.  A  standard  reference  for  the 
material  presented  hereafter  is  the  book  by  R.  Durett,  “Stochastic  Calculus:  A  Practical 
Introduction”  (CRC  1998).  For  a  discussion  of  the  Wiener  measure  and  its  link  with  path 
integrals  see  c.g.  the  book  by  M.  Kac,  “Probability  and  Related  Topics  in  Physical  Sciences” 
(AMS,  1991). 


1  The  Wiener  process  as  a  scaled  random  walk 


Consider  a  simple  random  walk  on  the  lattice  of  integers  Z: 

n 

^ n  =  (.ki  (1) 

A-l 


where  {£k}ke N  is  a  collection  of  indc])endent,  identically  distributed  (i.i.d)  random  variables 
with  P (£/,  =  ±1)  =  -.  The  Central  Limit  Theorem  (see  the  Addendum  at  the  end  of  this 
chapter)  asserts  that 


Xn_ 

y/N 


N{  0,1) 


(=  Gaussian  variable  with  mean  0  and  variance  1) 


in  distribution  as  N  — >  oc.  This  suggests  to  define  the  piecewise  constant  random  function 
WtN  on  t  G  [0,  oo )  by  letting 


WtN 


X  [TVtJ 


(2) 


where  [7V/J  denotes  the  largest  integer  less  than  Nt  and  in  accordance  with  standard  no¬ 
tations  for  stochastic  processes,  we  have  written  t  as  a  subscript,  i.e.  ~  WN(t). 

It  can  be  shown  that  as  N  —*  oo,  WtN  converges  in  distribution  to  a  stochastic  process 
termed  the  Wiener  process  or  Brownian  motion 1 ,  with  the  following  properties: 


(a)  Independence.  Wt  —  Ws  is  independent  of  {WT}r<fi  for  any  0  <  s  <  t. 

]Tho  Brownian  motion  is  termed  after  the  biologist  Robert  Brown  who  observed  in  ]827  the  irregular 
motion  of  pollen  particles  floating  in  water.  It  should  be  noted,  however,  that  a  similar  observation  had  been 
made  earlier  in  17G5  by  the  physiologist  Jan  Ingenhousz  about,  carbon  dust  in  alcohol.  Somehow  Brown’s 
name  became  associated  to  the  phenomenon,  probably  because  Ingenhouszian  motion  does  not  sound  very 
good.  Some  of  us  with  complicated  names  are  moved  bv  this  story. 
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Figure  1:  Realizations  of  Wj^  for  N  =  100  (blue),  N  =  400  (red),  and  N  =  10000  (green). 


(b)  Stationarity.  The  statistical  distribution  of  Wt+S  —  Ws  is  independent  of  s  (and  so 
identical  in  distribution  to  Wt). 

(c)  Gaussianity.  Wt  is  a  Gaussian  process  with  mean  and  covariance 

E Wt  -  0,  EWtWs  =  min (i,  a). 


(d)  Continuity.  With  probability  1,  Wt  viewed  as  a  function  of  t  is  continuous. 

To  show  independence  and  stationarity,  notice  that  for  1  <  m  <  n 

n 

Xn  Xrn  —  ^  ^ 

fc=m+ 1 

is  independent  of  Xm  and  is  distribute  identically  as  Xn-m.  It  follows  that  for  any  0  <  s  <  t, 
Wt  —  Ws  is  independent  of  Ws  and  satisfies 

Wt-Ws±  (3) 


where  =  means  that  the  random  processes  on  both  sides  of  the  equality  have  the  same 
distribution.  To  show  Gaussianity,  observe  that  at  fixed  time  t  >  0,  converges  as 
JV  — >  oo  to  Gaussian  variable  with  mean  zero  and  variance  t  since 


WtN 


X\Nt\  _  XlNt]  V[Nt} 
VN  _  y/[Nt\  VN 


N(0,l)Vt  =  N(0,t). 


In  other  words, 


where 


(5) 


p(x,t)  = 


e~x2/2t 

\f2nt 


In  fact,  given  any  partition  0  <  t\  <  t<i  <  •  •  •  <  tn,  the  vector  converges 

in  distribution  to  a  n-dimcnsional  Gaussian  random  variable.  Indeed,  using  (3)  recursively 
together  with  (4), (5)  and  the  independence  property  (a),  it  is  easy  to  see  that  the  probability 
density  that  (Wu , . . . ,  WUl )  =  (ari,  —  ,  xn)  is  simply  given  by 


p(xn  3'n  —  litr 
A  simple  calculation  using 

EWt  =  f  xp(x,  t)dx , 


tn—\)  *  *  *  P{%2  -  -h)p(xUti) 


EWtWs  =  /  yxp(y  -  x,  t  -  s)p(x ,  s)dxdy. 

Vr2 


(6) 


for  t  >  s  and  similarly  for  t  <  s  gives  the  mean  and  covariance  specified  in  (b).  Notice  that 
the  covariance  can  also  be  specified  via 

E(Wi  -  Wsf  =  1 1  -  s\, 

and  this  equation  suggests  that  Wt  is  not  a  smooth  function  of  t.  In  fact,  it  can  be  showed 
that  even  though  Wt  is  continuous  almost  everywhere  (in  fact  Holder  continuous  with 
exponent  7  <  1/2),  it  is  differentiable  almost  nowhere.  This  is  consistent  with  the  following 
property  of  self-similarity:  for  A  >  0 


Wt 


.  &  \-'nwu, 

which  is  easily  established  upon  verifying  that  both  Wt  and  \~~l^2Wxt  are  Gaussian  processes 
with  the  same  (zero)  mean  and  covariance. 

More  about  the  lack  of  regularity  of  the  Wiener  process  can  be  understood  from  first 
passage  times .  For  given  a  >  0  define  the  first  passage  time  by  Ta  =  inf {t  :  Wt  ~  a}.  Now, 
observe  that 

P {Wt  >  a)  =  P (T„  <  t  &  Wt  >  a)  =  £P (Ta  <  t).  (7) 

The  first  equality  is  obvious  by  continuity,  the  second  follows  from  the  symmetry  of  the 
Wiener  process;  once  the  system  has  crossed  a  it  is  equally  likely  to  step  upwards  as  down¬ 
wards.  Introducing  the  random  variable  Mt  =  sup0<s<t  ITS,  we  can  write  this  identity 
as: 

roc  p-z2/2t 


P {Mt  >  a)  =  P (Ta  <t)  =  2P (Wt  >  a 


)-/ 

J  a 


\Z2nt 


■ dz , 


(8) 


where  we  have  invoked  the  known  form  of  the  probability  density  function  for  Wt  in  the 
last  equality.  Similarly,  if  mt  =  info<s<t  Ws , 


(m.t  <  —a)  —  P {Mt  >  a). 


(9) 


But  this  shows  that  the  event  uWt  crosses  a”  is  not  so  tidy  as  it  may  at  first  appear  since 
it  follows  from  (8)  and  (9)  that  for  all  e  >  0: 


P (Me  >  0)  >  0  and  P(me  <  0)  >  0. 


(10) 


In  ])articular,  t  =  0  is  an  accumulation  point  of  zeros:  with  probability  1  the  first  return 
time  to  0  (and  thus,  in  fact,  to  any  point,  once  attained)  is  arbitrarily  small. 
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2  Two  alternative  constructions  of  the  Wiener  process 

Since  Wt  is  a  Gaussian  process,  it  is  completely  specified  by  it  mean  and  covariance, 


EWt  =  0  EWtWs  =  mm(t,s). 


(11) 


in  the  sense  that  any  process  with  the  same  statistics  is  also  a  Wiener  process.  This 
observation  can  be  used  to  make  other  constructions  of  the  Wiener  process.  In  this  section, 
we  recall  two  of  them. 

The  first  construction  is  useful  in  simulations.  Define  a  set  of  independent  Gaussian 
random  variables  {rjk}ke N?  each  with  mean  zero  and  variance  unity,  and  let  {(f>k{t)}ke N  be 
any  orthonormal  basis  for  L2[0, 1]  (that  is,  the  space  of  square  integral  functions  on  the  unit 
interval).  Thus  any  function  f(t )  in  this  set  can  be  decomposed  as  f(t)  =  Ylke 
where  (assuming  that  the  (j>k  s  are  real)  =  /q  f(t)<j>k{t)dt.  Then,  the  stochastic  process 
defined  by: 

Wt  =  Yvk  f  (12) 

ken  Jo 

is  a  Wiener  process  in  the  interval  [0,1].  To  show  this,  it  suffices  to  check  that  it  has 
the  correct  pairwise  covariance  -  since  Wt  is  a  linear  combination  of  zero  mean  Gaussian 
random  variables,  it  must  itself  be  a  Gaussian  random  variable  with  zero  mean.  Now, 


EWtWs  =  V  EtjkVi  f  <t>k{t')di'  [*  Ms'W 

k,ien 

=  T  [  k{t')dt'  [  (pk{s')ds\ 

JO  J  0 


(13) 


where  we  have  invoked  the  independence  of  the  random  variables  To  interpret  the 

summands,  start  by  defining  an  indicator  function  of  the  interval  [0,  r]  and  argument  t 


Xr{t) 


1  if  t  6  [0,  r] 
0  otherwise. 


If  r  6  [0, 1],  then  this  function  further  admits  the  series  expansion 

Xr(t)  =  ZMt)[  (t>k{t')dt'.  (14) 

k  J° 

Using  the  orthogonality  properties  of  the  {^(t)},  the  equation  (13)  may  be  recast  as: 


EWtWs=V]  [  (  f 

k,ien^°  ^ 0 

=  /  Xt(u)Xs(u)du 

Jo 

=  /  Xmin (t,s)(u)du  =  mm(t,s) 

Jo 


4>i(s')ds' <f>i{u)  )  du 


(15) 
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as  required. 

One  standard  choice  for  the  set  of  functions  {<f>k(t)}  is  the  Haar  basis.  The  first  function 
in  this  basis  is  equal  to  1  on  the  half  interval  0  <  t  <  1/2  and  to  -1  on  1/2  <  t  <  1,  the 
second  function  is  equal  to  2  on  0  <  t  <  1/4  and  to  -2  on  1/4  <  t  <  1/2  and  so  on.  The 
utility  of  these  functions  is  that  it  is  very  easy  to  construct  a  Brownian  bridge :  that  is  a 
Wiener  process  on  [0, 1]  for  which  the  initial  and  final  values  are  specified:  Wo  =  W\  =  0. 
This  may  be  defined  by: 

Wt  =  Wt-tWly  (16) 

if  using  the  above  construction  then  it  suffices  to  omit  the  function  4>\{t)  from  the  basis. 

The  second  construction  of  the  Wiener  process  (or,  rather,  of  the  Brownian  bridge),  is 
empirical.  It  comes  under  the  name  of  Kolmogorov- Smirnov  statistics .  Given  a  random 
variable  X  uniformly  distributed  in  the  unit  interval  (i.e.  P(0  <  X  <  x)  =  x),  and  data 
{Xl,-X2,  . . .  Xn },  define  a  sample-estimate  for  the  probability  distribution  of  X : 

\  1  ™ 

Fn{x)  =  -(number  of  Xk  <  x,  k  =  l,...,n)  =  -Y]x(-oo,x)(^k),  (17) 

k= l 

equal  to  the  relative  number  of  data  points  that  lie  in  the  interval  Xk  <  x .  For  fixed  x 
Fn(x)  —*  x  as  n  — >  oo  by  the  Law  of  Large  Numbers  tells  us  that,  whereas 

y/n(Fn(x)  —  x)  ^  N( 0,x(l  —  x)).  (18) 

by  the  Central  Limit  Theorem.  This  result  can  be  generalized  to  the  function  Fn  :  [0, 1] 

[0, 1]  (i.e.  when  x  is  not  fixed):  as  n  — »  oc 

V^(Fn{x)  ~x)±Wx-  xWx  -  Wx.  (19) 

3  The  Feynman-Kac  formula 

Given  a  function  /(x),  define 

u(x,t)=Ef(x  +  Wt)  (20) 

This  is  the  Feynmnn-Kac  formula  for  the  solution  of  the  diffusion  equation: 

To  show  this  note  first  that:  1 

u{x,t  +  s )  =  E/(x  +  Wi+S)  =  E /(*  +  (Wt+S  -  Wt)  +  Wt) 

=  Eu(x  +  Wt+S  —  Wt,  t)  =  E  u(x  +  Ws,  t) 

where  we  have  used  the  independence  of  Wt+S  —  Wt  and  Wt-  Now,  observe  that 
dii  .  1  , 

-r -lx,t)  =  lim  -  (u(x,t  +  s)  —  u{x,t)) 
ot  s— *04-  s 

=  lim  -E(u(x  +  Ws,t)  —  u(x,t)) 

s— »0-f  S 

=  A  ~s  (&(X'()EM/*  +  2M{X'mW’  +  °(s))  ’ 
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where  we  have  Taylor-series  expanded  to  obtain  the  final  equality.  The  result  follows  by 
noting  that  E Ws  =  0  and  EW2  =  s. 

The  formula  admits  many  generalizations.  For  instance:  If 

v(x,  t )  =  E f(x  +  Wt)  +  E  f  g(x  +  Ws)ds,  (22) 

Jo 


then  the  function  v(x,t)  satisfies  the  diffusion  equation  with  source-term  the  arbitrary 
function  g(x ): 

dv  1  d2v  .  .  .  . 

+  g(x)  v{x,0)  =  f{x). 


Or:  If 


dt  2  dx2 

f(x  +  Wt)  exp  (  J  c(x  +  Ws)ds 

then  w(x,t )  satisfies  diffusive  equation  with  an  exponential  growth  term: 

dw  1  d2w 


w(, 


x,t)  =  E 


dt  2  dx 2 


+  c(x)w  w(x,0)  =  f(x)  . 


(23) 

(24) 

(25) 


Addendum:  The  Law  of  Large  Numbers  and  the  Central  Limit 
Theorem 


Let  {Xj}jef$  be  a  sequence  of  i.i.d.  (independent,  identically  distributed)  random  variables, 
let  rj  =  EX  j  a 2  =  var(Xi)  =  E(Zi  —  rf)1  and  define 

n 

Sn  ~  Xj 

j= 1 

The  (weak)  Law  of  Large  Numbers  states  that  if  E|-Xy|  <  oo,  then 

—  — ►  r]  in  probability. 

The  Central  Limit  Theorem  states  that  if  E Xj  <  oo  then 

_ ^  — >  jV(0, 1)  in  distribution. 

v  ncr2 

We  first  give  a  proof  of  the  Law  of  Large  Numbers  under  the  stronger  assumption 
that  E|Xj|2  <  oo.  Without  loss  of  generality  we  can  assume  that  r\  =  0.  The  proof  is 
based  the  Chebychev  inequality :  Suppose  X  is  a  random  variable  with  distribution  function 
F(x)  =  ¥(X  <  x).  Then,  for  any  A  >  0, 

P(l*|  >  A)  <  Le|XP\  (26) 

provided  only  that  E|X|P  <  oo.  Indeed: 

ApP(|X|  >  A)  =  Ap  [  dF(x )  <  [  \x\pdF(x)  <  [  \x\pdF(x)  =  E|X|P. 

J\x\>\  J  |x|>a  Jr 
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Using  Chebychev’s  inequality,  we  have 


for  any  e  >  0.  Using  the  i.i.d.  property,  this  gives 

E|Sn|2  =  E\Xi  +  X2  +  . . .  +  Xn\2  =  nE|X]|2. 


Hence 


5. 


n 


>e!>  <  —E\Xi\2^0, 
nez 


as  n  — >  oo,  and  this  proves  the  law  of  large  numbers. 

Next  we  prove  the  Central  Limit  Theorem.  Let  /  be  the  characteristic  function  of  X\, 
i.e. 

f(k)  =  EeikXi ,  ke  R.  (27) 


and  similarly  let  gn  be  the  characteristic  function  of  Sn/ y  no"2.  Then 


gn(()  =  Ee^Sn/y/^  =  jj  Ee^/v/^  = 

3  =  1 

=  (1  +  ^EA''-2^EX'2  +  <,(A'_1))" 
—*  e  k  ^  as  n  — >  oo. 


This  shows  that  the  characteristic  function  of  Sn/V ncr2  converges  to  the  the  characteristic 
function  of  7V(0, 1)  as  n  — *  oo  and  terminates  the  proof. 

It  is  instructive  to  note  that  the  only  property  of  Xi  that  we  have  required  in  the 
central  limit  theorem  is  that  EX2  <  oo.  In  particular,  the  theorem  holds  even  if  the  higher 
moments  of  X\  are  infinite!  For  one  illustration  of  this,  consider  a  random  variable  having 
probability  density  function 


p{x) 


2 

7T ( 1  +  X2)2  ’ 


(28) 


for  which  all  moments  of  order  higher  than  2  are  infinite.  Nevertheless,  we  have: 


f(k)=  I  eik*p(x)dx  =  ( l  +  |fc|)e-|fe| 

J  R 

=  1  —  +  o(A:2), 

and  hence  the  Central  Limit  Theorem  applies.  Intuitively,  the  reason  is  that  the  fat  tails  of 
the  density  p(x)  disappear  in  the  limit  owing  to  the  rescaling  of  the  partial  sum  by  1/sqrtn. 


Notes  by  Marcus  Roper  and.  Ravi  Srinivasan. 
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Lecture  7:  Stochastic  integrals  and  stochastic  differential 

equations 

Eric  Vanden-Eijnden 


Combining  equations  (1)  and  (2)  from  Lecture  6,  one  sees  that  satisfies  the  recur¬ 
rence  relation 

WtNn  =  WtNn  +  U+i'/Ai,  W?  =  0.  (1) 

where  tn  =  n/N ,  At  =  l/N  and  {£n}neN  are  i.i.d.  random  variables  taking  values  ±1  with 
probability  5  as  before.  A  natural  generalization  of  this  relation  is 

*£+1  =  K  +  b(XtNn,  tn)At  +  a(XtNn,  tn)Zn+1VAi,  Xo  =  x  ■  (2) 

If  the  last  term  were  absent,  this  would  be  the  forward  Euler  scheme  for  the  ordinary 
differential  equation  (ODE)  Xt  =  b(Xt,t).  If  b(x,t)  and  e(x,t)  meet  appropriate  regularity 
requirements,  it  can  be  shown  that  X^  converges  to  a  stochastic  process  Xt  as  N  — >  oc 
(i.e.  as  At  — >  0  with  nAt  — >  t).  The  limiting  equation  for  Xt  is  denoted  as  the  stochastic 
differential  equation  (SDE) 

dXt  =  b(Xut)dt  +  a(Xut)dWu  X0  =  x,  (3) 

as  a  remainder  that  the  last  term  in  (2)  divided  by  At  does  not  have  a  standard  function  as 
limit.  The  notation  dWt  comes  from  (1)  since  this  equation  can  be  written  as  W^+1  —  = 

£n+1  y/At.  We  note  that  the  convergence  of  X^  to  Xt  holds  provided  only  that  the  £n’s  are 
i.i.d.  random  variables  with  mean  zero,  E£n  =  0,  and  variance  one,  E£^  =  1.  The  standard 
choice  in  numerical  schemes  is  to  take  =  iV(0, 1),  in  which  case 

y/Aiu+i  =  Wtn+l-Wtn. 

In  the  discussion  below,  however,  we  will  stick  to  the  choice  where  {£n}neN  are  i.i.d.  random 
variables  taking  values  ±1  with  probability  \  since  it  facilitates  the  calculations. 

Next,  we  study  the  properties  of  Xt  solution  of  (3)  and  introduce  some  nonstandard 
calculus  due  to  Ito  to  manipulate  this  solution. 

1  Ito  isometry  and  Ito  formula 

Consider  the  recurrence  relation 

XL,  =<  +  /«)«„+iVS,  %o  =  0. 
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Let  us  investigate  the  properties  of  the  limit  of  X^At  as  TV  — ►  oo,  assuming  that  this  limit 
exists.  The  limiting  form  of  the  recurrence  relation  above  is  traditionally  denoted  as 

dXt  =  f(Wut)dWu  Xo  =  0, 

which  can  also  be  expressed  as  the  stochastic  integral 

Xt  =  f  f{Ws,s)dWs. 

Jo 

Stochastic  integral  have  special  properties  called  the  ltd  isometry 

E  [*  f{W„8)dWa  =  0, 

Jo 

E (/  f(Ws,s)dWsy  =  J  E f2(Ws,s)ds. 

The  first  of  these  identity  is  often  written  and  used  in  differential  form 

Ef(Ws,s)d,W$  =  0. 

The  Ito  isometry  is  easy  to  demonstrate.  The  first  identity  is  implied  by 

n— 1 

E XtNn  =  E  Y 

7/1=0 

n— 1 

=  Y,  Ef(W£,tm)E£m+l\/Ai  =  0, 

777=0 

wliere  we  used  the  independence  of  the  £m’s  and  E£m  =  0.  The  second  identity  is  implied 
by 

71 

E(X»)2  =  E  Y  f(Wtl  >  tm)f(Wg ,  *p)£m+l£p+l  At 

777 ,7>=0 
71 

=  YE  f2(WZ,tm)At, 

777  =0 

where  we  use  the  fact  that  and  are  independent  unless  m.  =  p,  and  —  1  by 
definition. 

Going  back  to  (3),  a  very  important  formula  to  manipulate  the  solution  of  this  equation 
is  ltd  formula  which  states  the  following.  Assume  that  Xt  is  the  solution  of  (3)  and  let  / 
be  a  smooth  function.  Then  g{Xt)  satisfies  the  SDE 

d,g(Xt)  =  g'{Xt)dXt  +  y'{Xt)a2{Xut)dt 

=  {g\Xt)b{Xt^)  +  y\Xt)o2{Xut))dt  +  g'{Xt)o{Xut)dWt. 

If  g  depends  explicitly  on  then  an  additional  term  dg/dtdt  is  present  at  the  right  hand- 
side.  Ito  formula  is  the  analog  of  the  chain  rule  in  ordinary  differential  calculus.  However 
ordinary  chain  rule  would  give 

dg(Xt)=g'(Xt)dXt. 
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Here  because  of  the  non-differentiability  of  Xt)  we  have  the  additional  term  that  depends 
on  g"(x). 

The  proof  of  Ito  formula  can  be  outlined  as  follows.  We  Taylor  expand  s(X^+1 )  —  g(X^ ) 
using  the  recurrence  relation  (2)  for  Xj^  and  keep  terms  up  to  O(At): 

g(K+J  -  9«) 

=  g'(xO(x"+>  ~ K)  +  ~ O2  +  ■■■ 

+  \g"{X»)(b{X^tn)At  +  a(XtNn,tn)(n+1^y  +  0{At^) 

=  g'(X?n)(XtNn+i  -  XtNJ  +  y\XtNn)a2(X^tn)en+1At  +  O(AtW) 

=  g'(X?n)(X»+}  -  XQ  +  y'{X”)c\X?n,tn)At  +  0(At3/2), 

where  in  the  last  equality  we  used  =  1.  The  Ito  formula  follows  in  the  limit  as  At  — »  0. 


2  Examples 

The  Ito  isometry  and  the  ltd  formula  are  the  backbone  of  the  ltd  calculus  which  we  now  use 
to  compute  some  stochastic  integrals  and  solve  some  SDEs.  As  an  example  of  stochastic 
integral,  consider 

W$dWs . 

Taking  fix)  =  x 2  in  Ito  formula  gives 

\dW?  =  WtdWt  +  \dt . 

Therefore 

Wsd,Ws  =  \Wt  -  \t. 

Notice  that  the  second  term  at  the  right  hand-side  would  be  absent  by  the  rules  of  standard 
calculus.  Yet,  this  term  must  be  present  for  consistency,  since  the  expectation  of  the  left 
hand-side  is 

E  [  WsdWs  =  0, 

Jo 

using  the  first  Ito  isometry,  and  the  expectation  of  the  right  hand-side  is  zero  only  with  the 
term  7>t  included  since  ^E Wj*  =  \t. 

As  a  first  example  of  SDE,  consider 

d,Xi  —  — 'yXfdt  T  crdkU,  Xq  =  x 

This  is  the  Omstein-  Uhlenbeck  process.  Using  Ito  formula  with  f(x,t)  =  ellx,  we  get  (this 
is  Duhammel  principle) 


d(eytXt)  -  le^Xtdt  +  elldXt  =  ae^dWt. 
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Integrating  gives 

Xt  =  e-^x  +  <7  f  e~^~a)dWs. 

Jo 

This  process  is  Gaussian  being  a  linear  combination  of  the  Gaussian  process  W*.  Its  mean 
and  variance  are  (using  the  Ito  isometry) 

EXt  =  e~^x 

E{Xt  -  EXt)2  =  a2  /  (e_7(t_s))2ds  =  —  (1  -  e~27t). 

Jo  27 


Xt—*N  0 


Thus  when  7  >  0 


as  t  — >  00. 

As  a  second  example  of  SDE,  consider  the  so-called  geometric  Brownian  motion 


dYt  =  Ytdt  +  aYtdWu  Y0  =  y. 

This  process  has  some  application  in  mathematical  finance.  Ito’s  formula  with  f(x )  =  logx 
gives 

d\ogYt  =  ~(Ytdt  +  aYtdWt)  -  -—a^dt. 

Integrating  we  get 

Yt  =  y(^ah+oWl . 

Note  that  by  the  rules  of  standard  calculus,  we  would  have  obtained  the  wrong  answer 

Yt  =  yet+oMft  (wrong!) 


GO 


Indeed  the  term  —  ^a2t  in  the  exponential  is  important  for  consistency  since  taking  the 
expectation  of  the  SDE  for  Yt  using  the  first  Ito  isometry  gives 

dEYt  =  E  Ytdt, 


and  hence 

EYt  =  ye\ 

The  solution  above  is  consistent  with  this  since 

'EeaWt  =  ei°2t. 


3  Generalization  in  multi-dimension 

The  definition  of  Ito  integrals  and  SDE’s  can  be  extended  to  multi-dimension  in  a  straight¬ 
forward  fashion.  The  SDE 

K 

dX{  =  bj{Xu  t)dt  +  £  ajk(Xt,t)dWtk,  j  = 

k- 1 

where  {Wk)l Lj  are  independent  Wiener  processes,  defines  a  vector- valued  stochastic  process 
Xt  =  . . .  ,-X/).  The  only  point  worth  noting  is  the  Ito  formula,  which  in  multi¬ 

dimension  reads: 

if(xt)='£aJ^ddxi+\  £  t))dt 

j= 1  i  j,j’= 1  3  3  k= 1 

4  Forward  and  backward  Kolmogorov  equations 

Consider  the  stochastic  ODE 

dXt  =  b(Xt)dt  +  o{Xt)dWu  X0  =  y. 

Define  the  transition  probability  density  p(x,t\y)  via 

rx  2 

/  =  P{Xt+s  €  [xlta;2)|Xs  =  y}. 

Jx  i 

(p(rr,  t |y)  does  not  depends  on  s  because  b(x )  and  a(x)  are  time-independent.)  The  tran¬ 
sition  probability  density  is  an  essential  object  because  the  process  Xt  is  Markov ,  in  other 
words:  for  any  s  >  0 

P(X*+S  e  B[xlyx2)\{XS'}0<s,<s})  =  f>(Xt+s  G 

i.e.the  future  behavior  of  Xt  given  what  has  happened  up  to  time  s  depends  only  on  what 
Xs  was.  We  will  derive  equation  for  p.  Let  /  be  an  arbitrary  smooth  function.  Using  Ito 
formula,  we  have 

f{Xt)  -  f(y)  =  f  nxs)dxs  +  i  T  f"(Xa)a(X8)ds, 

Jo  Jo 


61 


where  a(.x)  =  o2(x).  Taking  expectation  on  both  sides,  we  get 


Ef(Xt)  -  f(y)  =  E  /  f'(Xs)b(Xs)ds  +  IE  /  f"(Xa)(X,)d8. 


or  equivalently  using  p 


[  f(x)p{x,t\y)dx-  f(y) 
Jr 


f'(x)b(x)p(x,s\y)dxds  +  \  /  /  f"(x)a(x)p(x,  s\y)dxds 


Since  this  holds  for  all  smooth  /,  we  obtain 


dp  _  _  d_ 
dt  dx 


with  the  initial  condition  lim*_>op(x,i|y)  =  <5(x  —  y).  This  is  the  forward  Kolmogorov 
equation  for  p  in  terms  of  the  variables  (x,t).  It  is  also  called  the  Fokker-Planck  equation . 

Equivalently,  an  equation  for  p  in  terms  of  the  variables  (y,  t)  can  be  derived.  The 
Markov  property  implies  that 

p(x,t  +  s\y)  =  /  p(x,t\z)p(z,s\y)dz. 

Jr 

Hence 

p{x,t  +  At\y)  -  p{x,t\y)  =  /  p(x,t\z)p(z,  At\y)dz  -  p(x,t\y) 

J  R 

=  [  p(x,t\z)(p(z,At\y)-S(z-y))dz. 

Ju 

Dividing  both  side  by  At  and  taking  the  limit  as  At  — ►  0  using  the  forward  Kolmogorov 
equation  one  obtains 

|f  =  f  p(x,t\z)[-J;m6(z  -  y))  +  lJP{a(z)5{z  -  y)))dz , 
which  by  integration  by  parts  gives 

This  is  the  backward,  Kolmogorov  equation  for  p  in  terms  of  the  variables  (y,  t).  The  operator 

L  =  b(y)^  +  ia(v)-^, 

is  called  the  mfinitesimnl  generator  of  the  process.  The  coefficient  b  and  a  can  be  expressed 
as 

Kv)  =  J™j(EyX(  -  y),  a{y )  =  Jim  jEy{Xt  -  y)2, 
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Figure  2:  Snapshots  of  the  density  of  the  Ornstein-Uhlenbeck  process  at  time  t  =  0.01 
(blue),  t  =  0.1  (red),  t  =  1  (green),  and  t—  10  (magenta).  Here  Xq  =  y  =  1  and  7  =  <7  =  1. 
The  last  snapshot  at  t  =  10  is  very  close  to  the  equilibrium  density. 


where  Ey  denotes  expectation  conditional  on  Xo  =  y , 

Both  the  forward  and  the  backward  equations  can  be  considered  with  different  initial 
conditions.  In  particular,  given  a  smooth  function  /,  if  we  define 


u(y,t)  =  E  yf(Xt), 

then  u(y,t)  =  / (x)p(x,  t\y)  and  hence  it  satisfies 

du  , ,  .  du  1  /  x  d2u 

m  =  bMdy  +  ia{y)W’ 

with  the  initial  condition  w(y,0)  =  /(y).  In  this  sense,  the  SDE  for  Xt  is  the  characteristic 
equation  that  is  associated  with  this  parabolic  PDE,  much  in  the  same  way  as  the  ODE 
Xt  =  b(Xt)  is  the  characteristic  equation  associated  with  the  first  order  PDE  du/dt  = 
b{y)du/dy.  This  can  be  generalized  in  many  ways.  For  instance,  the  solution  of 

dv  /  \  /  \  , ,  ,dv  ,  /  d2v 

—  =  c{y)v(y)  +  %)—  +  2a(y)g^2- 


with  the  initial  condition  u(y,  0)  =  /(y),  can  be  expressed  as 

v(y,t)  =  Eyf(Xt)eti<x^ds. 


This  is  the  celebrated  Feynman-Kac  formula  in  the  context  of  SDEs. 

Let  us  consider  an  example.  The  forward  differential  equation  associated  with  the 
Ornstein-Uhlenbeck  process  introduced  in  the  last  section  is 


dp 

dt 


7to(lA)  + 


a2  d2p 

Hdx2 
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The  solution  of  this  equation  is 


pOM \y) 


_ 1 _ pvnf  l(x-ye  7<)2^ 

v/7rcr2  ( 1  -  e~^)ff  exp  '  cr2  ( 1  -  e~2'>'t)  / 


This  shows  that  the  Ornstein-Uhlenbeck  process  is  a  Gaussian  process  with  mean  ye 
and  variance  cr2(l  —  e“27*)/27*  It  also  confirms  that  this  process  tends  to  jV(0, a2 /2^) 
t  —>  oo  since 


p(x)  =  lim  p(x,t\y )  = 

t — *00 


e-7x2/a2 

y/noy 7 


-7* 

as 


Generally,  the  limit  of  as  t  — ►  oo,  when  it  exists,  gives  the  equilibrium  density  p  of 

the  process.  It  satisfies 

0  =  +  \^a<<x)p)' 

Forward  and  backward  Kolmogorov  equations  can  also  be  derived  for  multi-dimensional 
processes.  They  read  respectively 


dp 

dt 


E  +  \  E 

j=l  3  J\j'= 1  ^ 


and 


dp  } 

w  =  E6 

j=i 


dp 


'iyX,dT„ 


c?2p 


jd'=i 


°^,wax*ac<’ 


where  =  Ea=i  ojk{x)ayk(x). 


Notes  by  Walter  Pauls  and  Arghir  Dani  Za,m.escu. 
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Lecture  8:  Asymptotic  techniques  for  SDEs 

Eric  Vanden-Eijnden 


Here  we  discuss  techniques  by  which  one  can  study  SDEs  evolving  on  very  different 
time-scales  and  derive  closed  equations  for  the  slow  variables. 


1  The  case  of  stiff  ordinary  differential  equations 


We  start  with  an  ODE  example.  Consider 

Xt  —  ~Yt  +  sin(7ri)  +  cos(v/27r  t)  Wo  =  x 

<  •  1  (1) 

Yt  =  —(Xt-Xt)  Y0  =  y. 

<  £ 

If  e  is  very  small,  Yt  is  very  fast  and  one  expects  that  it  will  adjust  rapidly  to  the  current 
value  of  Xt ,  i.e.  Yt  =  Xt  +  0(e)  at  all  times.  Then  the  equation  for  Xt  reduces  to 

Xt  —  +  sin(7r£)  +  cos(\/27r  t).  (2) 

The  solutions  of  (1)  and  (2)  are  compared  in  figure  1. 

Here  is  a  formal  derivation  of  the  limiting  equation  (2)  which  uses  the  backward  Kol¬ 
mogorov  equation.  For  simplicity  we  drop  the  term  sin(7nt)  +  cos(\/2 nt).  Generalizing  the 
derivation  below  with  this  term  included  is  easy  but  requires  a  slightly  different  backward 
equation  because  (2)  is  non-autonomous.  Let  /  be  a  smooth  function  and  consider 

u(x,y,t)  =  f(Xt). 


(This  function  depends  on  both  x  and  y  since  Xt  depends  on  both  these  variable  because 
Xt  and  Yt  are  coupled  in  (1),  and  there  is  no  expectation  since  (1)  is  deterministic.)  The 
backward  equation  is 


du 

di 


—  Lx.u  T  T-yU, 
£ 


where 

r  5  r  f  ^ 

=  L,  =  -(y-x )^. 

Look  for  a  solution  of  the  form  u  =  no  +  £^i  +  0(£2),  so  that  u  — >  uo  as  £  — >  0.  Inserting  this 
expansion  into  the  backward  equation,  and  grouping  terms  of  same  order  in  e,  one  obtains 


LyUQ  —  0, 

du0  (3) 

LyU\  —  — - LxUo, 
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2.5 


Figure  1:  The  solution  of  (1)  when  e  =  0.05  and  we  took  Xq  =  2,  Yq  =  —1.  Xt  is  shown  in 
blue,  and  Y%  in  green.  Also  shown  in  red  is  the  solution  of  the  limiting  equation  (2). 


and  so  on.  The  first  equation  tells  that  uo  belong  to  the  null-space  of  Ly,  i.e.  uo  =  uo(x,t). 
The  second  equation  requires  as  a  solvability  condition  that  the  right  hand-side  belongs  to 
the  range  of  Ly.  To  see  what  this  condition  actually  is,  multiply  the  second  equation  in  (3) 
by  a  test  function  p(y),  and  integrate  both  sides  over  R.  After  integration  by  part  at  the 
left  hand-side,  this  gives 


Llp(y)u}dy 


where  L *  is  the  adjoint  of  Ly  viewed  as  an  operator  in  y  at  fixed  x,  i.e. 

L*yp(y )  =  -  x)p{y)). 

Choosing  p(y )  such  that 

o  =  L*vp{y), 

one  concludes  that  the  solvability  of  (3)  requires  that 

' duo 


0=  Lp{y){^f-LxUo)dy- 


(4) 

(5) 


It  can  be  shown  that  this  equation  is  also  sufficient  for  the  solvability  of  (3)  -  the  calculation 
above  actually  tells  the  range  of  Ly  is  the  space  perpendicular  to  the  null-space  of  the  adjoint 
of  Ly.  Now,  (4)  is  simply  the  forward  Kolmogorov  equation  for  the  equilibrium  density  of 
the  process  Yt  at  fixed  Xt  =  x.  Here  the  equilibrium  density  is  a  generalized  function 


p(y\x)  =  6(y  —  x). 
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Using  this  p(y\x),  the  solvability  condition  (5)  becomes 

du0  ,  du0 

0=^r+:c-&' 

which  is  the  backward  equation  for 

Xt  =  ~Xl  Xo  =  x. 

A  similar  argument  with  the  term  sin(Trt)  +  cos(\/27rt)  included  gives  the  backward  equation 
for  (2). 

2  Generalization  to  stochastic  differential  equation 

The  derivation  that  lead  to  (2)  can  be  generalized  to  SDEs.  Consider 


dXt  =  f(Xt,  Yt)dt, 


X0  =  x 


dYt  =  -b{XuYt)dt  +  -L<r(XttYt)dt,  T0  =  y, 

£  y/£ 


(6) 


and  assume  that  the  equation  for  Y%  at  Xt  =  x  fixed  has  an  equilibrium  density  p(y\x)  for 
every  x.  Then  going  through  a  derivation  as  above  with 

u(x,y,t)  =  E  f(Xt), 

one  concludes  that  the  backward  equation  associated  with  this  SDE  also  reduces  to  (5)  as 
£  — *■  0,  i.e. 


d^o  =  pM^o 

dt  [  )  dx  ’ 


where 


F(x)  =  /  f(x,y)p{y\x)dy. 

Jr 

Thus  the  limiting  equation  for  Xt  is 

Xt  =  F(Xt),  Xo  =  0. 

The  main  difference  with  the  deterministic  example  treated  before  is  that  the  fast  process 
Yt  does  not  rapidly  settle  to  an  equilibrium  point  depending  on  the  current  value  of  Xt  - 
only  its  density  does. 

Here  is  an  example  generalizing  (1).  Consider 


f  dXt  =  —Y^dtt  +  sin(7 xt)  +  cos(\/27 rt),  Xo  =  x 

Y0  =  y. 


dYt  =  -~{Yt  ~  Xt)dt  +  -^=dWu 

e  yje 


(7) 


The  equation  for  Yt  at  fixed  Xt  =  x  defines  an  Ornstein-Uhlenbeck  process  whose  equilib¬ 
rium  density  is 


p{y\x)  = 


e-{y-x)2/a2 

y/na 
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Figure  2:  The  solution  of  (7)  with  Xo  =  2,  Yq  —  —1  when  e  =  10-3  and  a  =  1.  Xt  is  shown 
in  blue,  and  Yt  in  green.  Also  shown  in  red  is  the  solution  of  the  limiting  equation  (8). 
Notice  how  noisy  Yt  is. 


Therefore 

r  g—(y—x)2/Q2 

F(x)  =  -  /  y3— — 7= - dy  =  -a:3  -  %a2x, 

Jr  s/kq 

and  the  limiting  equation  is 

Xt  =  —Xf  —  |o2AT*  +  sin(7r/,)  +  cos(\/27rt),  Afo  =  x.  (8) 

Note  the  new  term  —  |o2A^,  due  to  the  noise  in  (7).  The  solution  of  (7)  and  (8)  are  shown 
in  figure  2. 

3  Strong  convergence  and  the  property  of  self-averaging 

The  derivation  in  section  2  only  give  weak  convergence,  or  convergence  in  distribution.  But 
stronger  results  can  be  obtained.  Consider  a  system  of  the  form 

Xf  =  f(Xf,Yt/e),  (9) 

where  Yt  is  a  given  stochastic  process.  Assume  that  Yt  is  ergodic,  in  the  sense  that  for  any 
fixed  x, 

lim  ^  /  f(x,Ys)d.s  —  f(x). 

7^oo  1  J0 

Then  we  can  show  that.,  as  e  — »  0,  Xf  converges  strongly  to  the  solution  of 


(10) 


To  see  this,  consider  the  integral  form  of  (9): 


/: t+At 

f(Xs,Ys/e)ds. 


(12) 


We  rewrite  this  equation  in  a  way  that  allows  us  to  exploit  the  self- averaging  property  (10). 


/t+m  rt+At 

f(Xt,  Ys/c)ds  +  J  {f(Xs,  Ys/e)  -  f(Xt,  Ys/e))  ds. 

We  will  consider  the  behavior  of  these  two  integrals  as  £  — »  0  separately. 

Using  (10),  the  first  integral 

pt+At  p(t+Ai)/e  __ 

•  /  f(Xu  Ys/e)ds  =  £  /  f(Xu  Ys)ds  -  Atf(Xt), 

Jt  Jt/e 

as  e  — >  0.  To  investigate  the  contribution  of  the  second  integral,  let 

rt+At 


/z-tm 

{f(Xs,Ys/e)-f(Xt,Ys/£))ds. 


We  then  have 


/t+At 

\f(Xs,Ys/£)-f(XuYs/e)\ds. 

Assuming  /  is  uniformly  Lipschitz  in  Yt  with  constant  AT,  we  then  write 

/t+At 

K  \XS  —  Xt\ds 

Xs-Xt-  r f(Xt,Y$l/£)ds' 

J*  f{XuYS'/£)ds' 


pt+Ai 

* 

pt+At 

+l 


It  is  straightforward  to  show  using  (14)  that,  for  sufficiently  small  e, 

rt+At 


/t+m  ps 

K  J  f(Xt,Ys//e)ds'  ds  <  CAt2 


for  some  constant  C  <  oo.  Gronwall’s  lemma  then  implies  that 

,  rt+At 

\Xt+At-Xt-  I  f(XuYs/£)ds  =  \A(t,At,e)\ 

<  CAt2  exp  K At  =  o{At). 

This  shown  that 

Jim  (X*t+At  -  XI)  =  Atf(Xf)  +  o(At). 
which  is  sufficient  to  demonstrate  that  Xf  converges  strongly  to  Xt. 


(13) 


(14) 

(15) 

(16) 


(17) 


(18) 


(19) 
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4  Diffusive  time-scale 


An  interesting  generalization  of  the  situation  presented  in  section  2  arises  when 


f{x,y)p{y\x)dy  =  o. 


(20) 


In  this  case  the  limiting  equation  reduces  to  the  trivial  ODE,  Xt  =  0,  i.e.  no  evolution  at 
all.  In  fact,  the  interesting  evolution  then  occurs  on  a  longer  time-scale  of  order  e'1,  and 
the  right  scaling  to  study  (6)  is 


jdXt  =  -£f(XuYt)dt,  X0  =  x 

I  dYt  =  4 b(Xu  Yt)dt  +  -a(Xt,  Yt)dt,  Y0  =  y, 
v  e*  e 


(21) 


To  obtain  the  limiting  equation  for  Xt  as  £  — >  0,  we  proceed  as  above  and  consider  the 
backward  equation  for  u(x:  y,  t)  =  E f(Xt),  which  is  now  rescaled  as 


du 

at 


Lx ^  ”t*  r\  LyU. 
£  £Z 


Inserting  the  expansion  u  =  uq  +  eu\  +  £2U2  +  0(e2)  (we  will  have  to  go  one  order  in  £ 
higher  than  before)  in  this  equation  now  gives 


LyU0  =  0, 

Ly'u  i  —  Lxu  Oi 

v  du0  T 

Lyv  2  =  -Q- - Lxu\, 


(22) 


and  so  on.  The  first,  equation  tells  that  uo(x,y,t)  —  uo(x,  t).  The  solvability  condition  for 
the  second  equation  is  satisfied  by  assumption  because  of  (20)  and  therefore  this  equation 
can  be  formally  solved  as 

Ui  =  —L~1Lxuq. 

Inserting  this  expression  in  the  third  equation  in  (22)  and  considering  the  solvability  con¬ 
dition  for  this  equation,  we  obtain  the  limiting  equation  for  ^o- 

du0  - 
— —  =  LxUq , 

dt 


where 


Lx=  dyp(y\x)LxLy  lLx. 


To  see  what  this  equation  is  explicitly,  notice  that  —Ly  lg{y )  is  the  steady  state  solution  of 


dv 

dt 


=  Lyv  +  g{y). 


The  solution  of  this  equation  with  the  initial  condition  v(y,  0)  =  0  can  be  represented  by 
Feynman-Kac  formula  as 

v(y,t)  =  E  f  g(Y?)ds, 

■w 
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where  Yx  denotes  the  solution  of  the  second  SDE  in  (21)  at  Xt  =  x  fixed  and  e  =  1,  i.e. 
dYf  =  b(x,  Ytx)dt  +  a(x,  Yx)dWu  Y0X  =  y. 

Therefore 

r  oo 

=  e  / 

JO 

and  the  limiting  backward  equation  above  can  be  written  as 

i!r  =  E l  * v)^(m ), 

This  is  the  backward  equation  of  the  SDE 

dXt  =  b(Xt)dt  +  a(Xt)dWu  X0  =  x, 


where 

6(®)  =E  jf  J^p(y\x)f(x,y)~f(x,Ytx)dydt, 

a2(x)  =  2E  /  f  p(y\x)f(x,y)f(x,Ytx)dydt. 

Jo  Jr 

The  interesting  new  phenomena  is  that  the  limiting  equation  for  Xt  has  become  an  SDE. 
This  means  that  fluctuations  are  important  on  the  long-time  scale  and  give  rise  to  stochastic 
effects  in  the  evolution  of  Xt  that  were  absent  on  the  shorter  time-scale. 

The  calculation  above  is  easy  to  generalize  if  there  is  a  slow  term  in  the  original  equation 
for  Xt,  i.e.  if  instead  of  (21)  one  considers 

I dXt  =  g(Xt,  Yt)dt  +  -£f(Xt,  Yt)dt,  Xo  =  x 

I  dYt  =  4 b(Xu  Yt)dt  +  -a (Xt,  Yt)dt ,  Y0  =  y, 

The  limiting  equation  for  Xt  is  then 

dXt  =  G(Xt)dt  +  b{Xt)dt  +  d(Xt)dWu  X0  =  x, 
with  b(x)  and  a(x)  as  above,  and 

G(x)  =  /  p(y\x)g{x,y)dy. 

Jr 

It  is  also  straightforward  to  generalize  to  higher  dimensions. 

Here  is  an  example. 


2r>  0 

dXt  =  — YtZtdt  -  (Xt  +  X?)d,t, 

q  IT 

<  dYt  =  —ZtXtdt  -  -zYtdt  +  -dW?, 

£  £z  £ 

dZt  =  --hYtXtdt  -  \ztdt  +  - dWf . 

£  £z  £ 
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0.5 


Figure  3:  The  equilibrium  density  p(x )  —  Z  le^a2 
(red). 


i)x2  4 x4  for  a  =  |  (blue)  and  a  =  2 


where  Wty,  Wt2  are  independent  Wiener  processes  and  a  is  a  parameter.  There  are  two 
fast  variables,  Y/  and  Z*,  in  this  example.  There  is  also  a  slow  term,  —  (Xt  +  Xf)dt ,  in 
the  equation  for  Xt  which,  in  the  absence  of  coupling  with  Yt  and  Zu  would  drive  X%  to 
the  position  x  =  0.  We  ask  to  what  extend  this  equilibrium  of  the  uncoupled  dynamics  is 
relevant  with  coupling  with  Yt  and  Zt. 

The  limiting  equation  for  Xt  is 

d.Xt  =  ((a2  -  1  )Xt-  Xf)dt  +  adWt. 

The  equilibrium  density  for  this  equation  is 

p(x)  = 

This  density  is  shown  in  figure  3.  For  |g|  <  1,  p(x)  is  mono-modal  and  centered  around 
x  =  0,  the  stable  equilibrium  of  the  uncoupled  dynamics.  However,  for  |a|  >  1,  p(x)  be¬ 
comes  bi-modal,  with  two  maxima  at  x  =  ±va2  -  1  and  a  minimum  at  x  =  0.  Thus 
coupling  with  the  fast  modes  may  destroy  the  structures  apparent  in  the  uncoupled  dynam¬ 
ics  and  induce  bifurcations. 

Notes  by  Inga.  Koszalka  and  Alex  Hasha. 
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Lecture  9:  The  use  of  variational  methods  for  high-contrast 

conductivity  problems 

George  C.  Papanicolaou 


1  Introduction 

In  this  lecture  we  will  consider  the  conductivity  in  a  high-contrast  medium.  Besides  its 
physical  importance,  the  model  under  consideration  will  serve  as  an  illustration  of  the  use 
of  variational  principles.  This  will  provide  a  good  introduction  to  variational  principles 
before  using  them  in  a  more  difficult  form  in  the  next  lecture,  where  convection-diffusion 
problems  at  high  Peclet  numbers  (strong  convection  versus  weak  diffusion)  are  considered. 

2  General  formulation 

Consider  a  smooth  region  fl  C  R2  with  outward  unit  normal  n(x)  and  with  given  non¬ 
negative  conductivity  a(x)1.  The  governing  equation  for  the  potential  $  is 

V  •  [a(x)V$]  -  0,  x  €  fi,  (1) 

with  Neumann  boundary  condition2 

<r(x)“—  =  J(x),  x  G  dQ.  (2) 

on 

The  outgoing  current  J(x)  is  assumed  to  be  equilibrated,  hence 

[  IdS  =  0.  (3) 

Jon 

Next,  let  us  introduce  s(x)  by  assuming  that  the  conductivity  has  the  form 

a(x)  =  a0e-sW/£,  (4) 

where  we  are  interested  in  the  high-contrast  limit  characterised  by  e  [  0. 

Plugging  (4)  into  (1)  leads  to 


-  Vs  •  V$  =  0.  (5) 

aThe  analysis  will  not  be  affected  qualitatively  in  3  dimensions,  but  for  simplicity  we  will  consider  R2, 

2  Dirich  let  boundary  condition  =  'L  can  be  analysed  as  well,  without  qualitative  changes. 
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Notice  that  the  operator  on  <3>  in  (5)  is  self-adjoint,  as  opposed  to  the  similar  equation  for 
a  divergence- free  fluid,  which  will  be  discussed  in  more  detail  in  the  next  lecture.  In  fact, 
(5)  is  difficult  to  solve  in  general,  hence  in  the  following  section  the  classical  variational 
principles  will  be  introduced  to  help  us  estimate  the  solution  without  solving  the  equation 
itself. 


3  Variational  principles 

To  introduce  the  classical  variational  principles,  we  first  need  to  define  Dirichlet- to- Neumann 
(DtN)  and  Neum ann-to-Di richle t  (NtD)  maps. 

The  DtN  map  A  takes  Dirichlet  boundary  data  to  the  outgoing  current 

I  =  ,  hence  A'P  =  /.  Furthermore,  given  (3)  the  NtD  map  can  be  defined  as  the  inverse 

of  the  DtN  map,  namely,  'k  =  A"1/.  Without  going  into  details,  let  us  note  that  after 
determining  these  two  maps,  one  has  almost  all  the  information  about  the  problem  that 
can  be  observed  at  the  boundary. 


A  is  a  self-adjoint,  positive  semidefinite  map  with  respect  to  the  standard  inner  product. 
Indeed, 


(A'lq  $)  =  /  A {x)ty (x)  dS  =  /  /(x)'I'(x)  dS  =  (using  the  boundary  conditions) 

JdQ  Jon 

a(x)~//X~^(x)  dS 

Jon  dn 


It 90 


<I>(x)<t(x)V$  •  ndS  =  (by  the  divergence  theorem) 


=  /  V-(3 >(x)a(x)V$)dV  =  [  a(x 

Jn  Jn 


)V<I>  •  V<Z>dV  >  0. 


(6) 


which  demonstrates  that  A  is  positive  semidefinite.  In  the  last  step  we  integrated  by  parts 
and  used  (1).  Now  let  \['i  and  ^2  be  two  different  sets  of  Dirichlet  boundary  data.  Using 
(6)  we  see  that 


(Atf'i,  ^2)  =  /  A^j(x)^2 (x)dS  =  /  cr(x)V<3>  1  ■  =  (by  symmetry) 

Jdn  Jn 


on 


A^2(x)^i  (x)dS  =  (^j,A^2) 


(7) 


and  thus  the  map  A  is  self  adjoint. 

Now  we  are  ready  to  introduce  the  Dirichlet  variational  principle (D VP): 

(A'lq  \k)  —  min  |  J  aVl>  ■  V$dV  |  is  square-integrable  and  &\dn  =  «  (8) 

To  prove  the  DVP  (8),  we  consider  the  Euler-Lagrange  equations  for  the  variational  problem 
on  the  right  hand  side.  If  an  integral  K  is  of  the  form 


K  =  if  (*■*'•)  iv'  4.^,  =  ^, 


(9) 
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then  the  corresponding  Euler-Lagrange  equations  for  solving  5K  =  0  by  varying  $  are, 
using  the  summation  convention, 


_d_ 

dxi 


df 

d$Xi 


=  0. 


(10) 


For  equation  (8)  f($,$Xi)  =  cr$Xi$Xi  and  thus  the  Euler-Lagrange  equations  become 

sjO’*-)-0-  (11) 

This  is  simply  our  original  conductivity  equation  (1),  and  thus  the  integral  in  the  DVP 
(8)  is  minimised  when  $  =  $  where  <3>  solves  (1).  The  integral  in  the  DVP  is  called  the 
Dirichlet  integral  and  measures  the  rate  of  energy  dissipation. 


The  DVP  can  be  written  in  another  form,  namely, 

(A'F,  tf)  =  min  {j  aE  ■  EdV  |  E  =  V3>  is  a  curl-free  field  and  $|sn  =  $  j  .  (12) 

This  form  of  the  DVP  helps  to  illustrate  better  the  duality  of  DVP  with  the  Kelvin 
variational  principle (K VP): 

(I,  A"1/)  =  mm  { jf  a~1} -}dV  |  V  •  j  =  0  and  j  •  n|an  =  ij ,  (13) 

where  j  =  aV$  is  the  divergence- free  current.  A  similar  calculation  shows  that  the  mini¬ 
mum  is  realized  by  j  =  aV$,  where  $  is  again  the  solution  of  (1). 

Notice  that  while  we  cannot  solve  the  conductivity  equation  (1)  in  general,  we  know 
beforehand  that  the  solution  must  be  the  minimiser  of  the  functionals  appearing  in  both 
the  DVP  and  KVP.  This  important  feature  allows  us  to  bound  both  A  and  A^1  from  above 
by  taking  appropriately  well-constructed  test  functions  <J>  and  j.  It  can  be  shown  that  this  is 
equivalent  to  finding  both  upper  and  lower  bounds  for  the  map  A.  In  some  problems  these 
bounds  coincide,  giving  rise  to  the  exact  solution.  This  method  is  particularly  well-suited 
for  problems  in  the  high-contrast  limit,  where  we  try  to  find  an  asymptotic  form  for  the 
solution  or  at  least  bound  it  from  both  above  and  below. 


4  The  high-contrast  conductivity  problem 

In  this  part  of  the  lecture,  we  consider  a  particular  problem  which  will  serve  as  a  bench¬ 
mark  to  illustrate  the  application  of  the  variational  principles  in  the  conductivity-related 
problems. 

Consider  the  conductivity  equation  (1)  with  $(x, y)  =  x(xiV)  + 

V-k(Vx  +  e1)]  =  0,  (14) 
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where  for  simplicity  we  have  dropped  the  arguments  (x,  y).  Here  ex  is  the  unit  vector  along 
the  .r-axis.  Let  the  domain  be  the  square  region  D  —  [—1/2, 1/2]  x  [—1/2, 1/2].  Consider 
solutions  with  periodic  potential  x(x,y),  which  must  be  unique  up  to  an  additive  constant. 
We  will  be  interested  in  the  quantity 

a*(ei)  =  MVx  +  eO-ei),  (15) 

where  the  average  (  )  is  taken  over  D.  cr*(ei)  can  be  interpreted  as  the  avemge  flux  per 
unit  average  gradient  in  the  direction  ei. 

Suppose  cr(x,y)  satisfies  a(x,y)  =  with  the  high-contrast  assumption  0  < 

e  1.  Then  we  can  write  cr*(ei)  in  the  following  form  (justified  by  an  integration  by  parts 
and  (15)): 


<r*(ei)  =  (cr(V*  +  ei) .  (Vx  +  ei))  =  /  cr(Vx  +  ei)  •  (Vx  +  ea)  dixdy.  (16) 

Jd 

Furthermore,  using  the  Dirichlet  variational  principle, 

a*{ei)  =  min  [  a0e~s/c(\7x  +  ei)  •  (Vx  +  ex)dxdy.  (17) 

X  JD 

We  will  now  consider  the  case  where  there  is  a  single  saddle  point  in  our  domain  (Figure 
1).  The  integral  in  (17)  cannot  be  tackled  by  Laplace’s  method,  since  x  itself  depends  on 
the  infinitesimal  parameter  e.  In  fact,  the  major  contribution  in  the  integral  comes  from 
the  neighbourhood  of  the  saddle  point,  which,  without  loss  of  generality,  can  be  assumed 
to  be  at  the  origin  with  principal  axes  aligned  with  the  coordinate  axes.  Since  the  gradient 
of  the  function  at  a  saddle  point  vanishes,  we  have  the  following  Taylor  expansion  of  s(x,y) 
up  to  second  order: 

s(x,  y)  «  So  -  y  <r2  +  y  y2,  (18) 

where  Aq  and  k,2  are  the  principal  curvatures  of  the  level  curves  of  5  intersecting  at  the 
saddle  point. 

Next,  we  pass  into  an  approximate  inequality  by  shrinking  the  integration  region  to 
A  =  [—5,5]  x  [—5,5]  and  plugging  the  truncated  expansion  (18)  into  (17),  as  well  as  by 
minimising  only  among  the  functions  x{x:V)  —  x(*T): 

<7*(ei)  ^  min  /  ao exp  [s0  -  yx2  +  yy2])  {Xx  +  ifdxdy 

«  aoe~so/\flfl  min  [*  e^x\xx  +  l)2  dx.  (19) 

V  «2  x(x)  J_i 5 

By  DVP,  the  solution  of  (14)  is  the  minimiser  of  the  functional  above,  hence  we  will  look 
for  x(x)  satisfying  the  equation 


(e2< x  (xx  +  l))x  —  0, 


(20) 
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Figure  1:  Topography  of  s(x,  y )  in  the  square  region  D.  At  the  origin  there  is  a  saddle  point. 
The  filled  line  through  the  origin  is  a  valley,  the  dashed  line  a  ridge.  The  local  maximum 
is  a  filled  circle,  the  local  minima  a  hollow  circle.  The  analysis  focuses  on  a  small  region  A 
near  the  saddle  point. 


as  well  as  the  periodicity  condition  for  Xi  which  implies 

<X*>  =  0.  (21) 

Since  e  is  very  small,  the  average  in  (21)  can  be  taken  over  the  neighbourhood  of  the  saddle 
point  [—5,5]  as  well  as  over  the  whole  interval  [—1/2, 1/2]  without  changing  the  leading 
order  asymptotic  term. 

Solving  (20),  one  finds 

Xx  +  l  =  Ce-^x\  (22) 

while  the  constant  C  can  be  found  from  (21): 

l  =  (Xx  +  l )  =  (Ce-fe*2)  ~  (23) 
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Hence  the  optimising  function  satisfies  the  approximate  equation,  which  is  asymptotic  to 
the  solution  of  (14)  as  e  — ►  0 


Xx  +  1  ~ 


2c  ' 


Plugging  into  the  integral  (19)  leads  to 


cr*(ei)  °oe  So/e 


e  2cx 


27Tt^ 
k  i 


(24) 


(25) 


which  is  the  conductivity  at  the  saddle  point  aoe  multiplied  by  the  factor  y  deter¬ 
mined  by  the  curvatures  of  the  level  sets  passing  through  that  saddle  point.  For  instance, 
small  k‘2  corresponds  to  a  narrow  saddle  point,  where  the  conductivity  is  large. 


Using  KVP  for  the  backward  NtD  map,  one  can  find  a  lower  asymptotic  bound  for 
cr*(ei)  which  turns  out  to  be  exactly  the  same  as  in  (25)!  This  leads  to  the  exact  asymptotic 
expression  for  the  average  resulting  flux  in  the  x-direction 


cr*(ei) 


as  e 


0. 


The  corresponding  resistance  p*  =  1  /a*  is  given  by 


P*(ei)  ~ 


as  e 


0. 


(26) 


(27) 


5  Complicated  topography 

Wo  now  consider  the  situation  where  we  have  multiple  saddle  points  in  our  domain.  Figure 
2  gives  an  example  of  such  a  situation. 

To  understand  how  current  flows  through  the  domain  in  Figure  2  it  is  useful  to  make  an 
analogy  with  the  flow  of  water.  Consider  the  case  where  current  flows  into  the  domain  over 
a.  It  will  flow  directly  to  the  nearest  point  of  maximum  conductivity,  node  1.  There  current 
will  “poor’  before  escaping  through  the  “channels”  (saddle  points)  to  the  adjacent  nodes. 
From  these  nodes  current  will  then  flow  either  to  other  nodes  via  the  channels,  or  out  of 
the  boundary.  Hence  intuitively  the  domain  can  be  thought  of  as  behaving  as  a  network  of 
channels. 

More  formally,  we  have  that  the  dominant  contribution  to  the  DtN  map  A  as  e  j  0 
is  determined  by  the  saddle  points  of  s(x,  y).  At  each  saddle  point  we  can  calculate  the 
resistance  of  the  saddle  using  the  result  for  a  single  saddle  (27).  Denote  the  resistance  of 
the  saddle  point  between  node  i  and  node  j  as  R^j.  Note  that  R+j  is  symmetric:  R.tj  =  Rji. 
Since  each  saddle  can  be  considered  as  a  single  resistor,  we  can  reduce  the  problem  to  a 
simple  resistor  network  (Figure  3). 
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Figure  2:  An  example  domain  with  multiple  saddle  points.  The  topography  of  $(ar,  y)  is 
shown.  The  filled  circles  are  local  maxima  of  conductivity,  and  are  numbered  1,  2,  3,  4 
(we  will  refer  to  these  as  the  nodes).  The  lines  dividing  the  domain  are  the  valleys.  The 
corresponding  divided  parts  of  the  boundary  are  labelled  a,  6,  c,  d.  The  saddle  points  are 
shown  as  two  short  parallel  lines,  resembling  channels.  The  resistance  Rij  has  been  labelled 
by  each  saddle  (see  later  discussion). 


The  DtN  map  A  of  the  full  problem  is  asymptotic  as  e  j  0  to  the  DtN  map  of  the 
resistor  network.  Consider  the  Dirichlet  problem  where  the  potential  $  is  specified  on  the 
boundary,  Then  equation  (8)  becomes 

(Att,\P)=  min  [  V&dV  <  min  V  (28) 

*•*1  ea=9Jn  dn=^kj€nodeskei/.Rjk^  ' 

The  above  expression  specifies  an  asymptotic  upper  bound  for  the  DtN  map  A.  Here  the 
set  Vj  is  the  set  of  nodes  adjacent  to  the  node  j,  and  is  the  potential  at  node  j .  are 
the  integrated  potentials  specified  on  the  sections  of  boundaries  k.  The  boundary  condition 
is  now  that  the  potentials  of  nodes  adjacent  to  the  boundary  are  equal  to  the  potentials 
VE*/,  on  the  boundaries.  For  the  example  domain,  the  boundary  condition  becomes  $1  =  Ta, 
$2  =  Tfc,  $3  =  \l/c,  and  $4  =  In  this  simple  case  it  means  that  all  have  been 
determined,  but  in  more  complicated  cases  there  can  be  $/,  in  the  interior  of  the  domain 
which  are  not  directly  specified  by  the  boundary  condition.  Even  in  these  more  complicated 
cases,  the  minimisation  is  now  just  an  easy  to  solve  matrix  problem. 

Similarly  we  can  solve  the  dual  problem  (13)  where  the  current  j  is  specified  on  the 
boundary  rather  than  the  potential  The  dual  problem  yields  a  corresponding  asymp¬ 
totic  upper  bound  for  the  inverse  map  A-1,  and  thus  an  asymptotic  lower  bound  for  A.  As 
in  the  single  saddle  case  it  turns  out  that  the  asymptotic  lower  bound  for  A  is  the  same  as 
the  asymptotic  upper  bound,  and  thus  we  get  an  asymptotic  equality. 

Notes  by  Khachik  Sargsyan  and  John  Rudge . 
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Lecture  10:  Convection  Diffusion  Problems 


George  C.  Papanicolaou 


1  2-D  Convection-Diffusion 

Consider  a  2-D  divergence-free,  periodic,  steady  flow  field  u(t,  x )  in  a  domain  without  any 
boundaries.  Let  p{t)  x )  be  the  concentration  of  a  passive  scalar,  say  temperature.  Then  the 
non-dimensional  governing  equations  for  the  non-dimensional  variables  p  and  u  are: 

pt  +  u  *  Vp  =  eAp  ,  (1) 

V  •  u  =  0  ,  (2) 

together  with  the  initial  condition  p(0,  x)  —  p°(x).  Note  that  e  is  dimensionless  parameter 
since  e”1  ~  ULjv  =  Pe,  where  Pe  is  the  Peclet  number  and  L  is  the  size  of  the  peri¬ 
odic  cell.  By  integrating  (1)  over  fR2  and  using  (2),  we  see  that  if  f^2  p°(x)dx  =  1,  then 

2  p(t ,  x)dx  =  1.  Also  if  p°(x)  >  0,  then  p(t,  x)  >  0.  Since  V  *  u  =  0  and  the  flow  is  2-D, 

it  is  possible  to  introduce  a  stream  function  x/;(x): 

U  =  (— 1>x)  *  (3) 

If  y)  =  sin x sin  y  +  5 cos x  cosy,  then  we  have  a  cellular  flow  if  6  =  0,  and  a  shear  flow 
if  S  =  1.  Since  x(t)  is  the  position  of  a  diffusing  particle,  the  evolution  equation  for  x(t) 
can  be  written  as  the  following  SDE: 

dx(t)  =  u(x(t))dt  +  V2e  dW(t).  (4) 

If  there  is  no  diffusion  (i.e.  there  is  no  y/2e  dW(t )  term  in  (4)),  a  particle  starting  on  a 
particular  streamline  remains  on  the  streamline.  If  we  have  diffusion,  there  is  a  possibility 
for  a  particle  which  starts  in  the  region  (a)  to  move  to  the  region  ( b )  (See  Figure  1).  In 
that  case,  p  can  be  interpreted  as  the  probability  density  of  x(t). 
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Figure  1:  Rough  sketch  of  the  periodic  cell 


2  Effective  diffusivities 


Consider  a  diffusing  particle,  lim^-.^  |E  [{(x(t)  —  a;(0))  •  e}2]  always  exists  for  u(x )  which 
is  periodic  and  satisfies  V  •  u  =  0  and  (u(x))  =  0,  where  (•)  represents  the  periodic  cell 
average.  We  denote  this  limit  as  cr*(e),  so  called  the  effective  diffusivity .  It  is  a  quadratic 
form  of  e. 

We  now  take  the  large  time,  long  distance  limit  of  the  PDE  (1)  by  changing  the  variables 
t  — *  n?t ,  x  — >  nx  and  letting  n  — ►  oo.  (This  process  is  called  the  homogenization .)  pri(t,  x)  = 
p(??2/,,  nx)  converges  to  p(/.,  x)  in  an  appropriate  sense  as  n  — >  oo,  where  p(£,  x)  is  the 
solution  of  the  homogenized  equation 


Pt  =  V  *  «Vp)  , 

with  p(0,  x)  =  p°(x).  cr*(e)  is  a  constant  matrix,  or  more  precisely 

<(e)  =  <(e/  +  *)(Vx  +  e)  •  e)  , 

where  I  is  the  identity  matrix,  x(x)  is  a  periodic  function  in  fH2,  and 


I'(.t) 


0 

V  V’f®,  y) 

It  is  found  that  cr*(e)  satisfies  the  polarization  relation 


(5) 

(6) 

(7) 


c  hj 


=  4^ 


e,  +e,-  -  crf  (Ci 


/)]  )  ®i  j  —  1)  2, 


(8) 


where  e\  =  (1,  0),  e2  =  (0,  1).  Apart  from  the  homogenized  equation  (5),  the  homogeniza¬ 
tion  process  also  yields  the  cell  problem,  that  is 


V-[(e/  +  *(.T))(VX  +  e)]=0. 


(9) 
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a*  can  be  calculated  by  solving  (9)  for  x  and  plugging  it  into  (6).  The  full  derivation  of 
(5),  (6)  and  (9)  will  be  shown  in  the  next  section.  The  physical  interpretation  of  cr*(e)  is 
the  average  flux  in  the  direction  e  when  there  is  a  unit  average  gradient  in  the  direction  e. 


3  Asymptotics  for  pn(t,x ) 

Recall  the  passive  scalar  advection  equation  in  the  fast  variables 

^  =  V-([I+*n(x)]Vpn)  (10) 

with  initial  condition 


Pn(  0,X)  =  p°(x) 

where  I  is  the  identity  matrix,  \I/n  was  defined  previously,  and  we  have  set  e  =  1.  First  we 
must  check  that  (10)  solves  (1) 


—  ( Pn)xx  “t~  ( Pn)yy  ('0n)x(^/;n)y  "I”  ('0n)y('0n)a:  '*Pn(pn)xy  T  ^Pn{Pn)yx 

~  £^Pn  U  *  Vpn. 

Next  we  expand  pn  in  an  asymptotic  series 

pn(t,  x)  =  p(t,  x)  +  - p W  (t,  x,  nx)  +  \p^  (t,  X,  nx)  +  ... 
n  nz 

It  is  clear  that  for  this  problem  we  have  a  clean  separation  of  scales.  The  fast  time  scale 
does  not  appear  because  the  coefficients  are  time  homogeneous. 


Let  nx  =  £  so  that  V  — +  V*  +  nV^.  Plugging  pn  into  (10)  we  get 


o(  2) 


+  -  = 


|(p+V>+ v 

at  \  n  nz 

(Vx  +  nVc)-  (7  +  ^n(C)  '  (V*  +  «Vf)  (p  +  lp(1)  +  ~2p^  +  •••)  • 

As  is  standard  procedure,  we  equate  the  coefficients  for  powers  of  n.  At  0(n2): 

V€-[(/  +  *n(O)V€p]  =  0. 

Note  (11)  is  automatically  satisfied  since  p  is  not  a  function  of  £.  At  0(n): 

•  [(7  +  *n(0)  vxp]  +  vx  •  [(/  +  *„(0)  vcp]  +  vc  •  [(/  +  *„(0)  vcp(1)]  =  o. 


(ii) 


(12) 
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The  second  term  in  (12)  is  zero  via  (11).  Upon  rewriting  (12)  we  get 


•  [(/  +  *„(0)  (v^(1)  +  v^)]  =  0  (13) 

which  resembles  the  cell  problem  (9).  Equation  (13)  is  a  PDE  for  p^(£)  (periodic  in  £). 
We  can  cast  (13)  into  the  cell  problem  by  letting 


which  separates  the  £  dependence  from  the  t,x  dependence.  The  function  Xe(0  satisfies 

V{-[(/  +  *„(0)(VCXe(  O  +  e)]  =  0.  (14) 

At  0(1): 

=  v5  •  [(/  +  *„(0)  +  v£  •  [(/  +  *„(*))  Vxp(1)]  + 

v*  -  [(/  +  ®„(e))  vcp(,)]  +  v,  •  [(/ + *„(0)  wp] 

which  is  a  PDE  for  p(2)(£)  (periodic  in  £)  with  x  as  parameters.  This  can  be  re-written 
as 


v5  ■  [(/  +  *„(0)  Vcp(2)]  +5  =  0 

(15) 

where 

5  =  V{  •  [(/  +  *„(0)  Vxp(1)j  +  Vx  •  [(/  +  ¥n(0)  V£pW]  +  Vx 

•l(nU))Vx/»]-|. 

Upon  taking  the  c:ell  average  of  (15),  we  obtain 

(v,-  [(/  +  *„(£))  V£A|)  + (5)  =0 

(16) 

and  since  is  a  gradient  of  a  periodic  function,  (S)  =  0  which  yields 

=  ( v£  •  [(/  +  ¥„(0)  V*p(1)]  )  +  (vx  •  [(/  +  *„(£))  V£p(1)] )  + 
(V,  •  [(/  +  tf„(0)  vxp]) 

=  vx-  [((/  +  *„( 0)  (v^w  +  v*?))' 
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since  Vxp^  is  the  gradient  of  a  periodic  function.  In  component  form 


where  Aij  =  Iij  +  (£)•  Thus,  we  obtain  the  homogenized  equation 

dp  _  *  d2p 

dt  °lk  dxidxk 

i,k 

with 


|  =  V.KV,). 

In  summary,  the  key  ideas  for  homogenization  are: 


(17) 


(18) 


1)  Perform  a  multiscale  expansion 

t)  x  ~  macroscopic  scales  (slow) 
n2t,  nx  ~  microscopic  scales  (fast) 

the  resulting  PDE  will  involve  both  fast  and  slow  variables.  In  our  case  'ip  — >  'ip( nx ).  In 
general  ip  — >  ^(nH^nx^t^x). 


2)  Seek  an  expansion  in  which  the  principle  term  is  slowly  varying  (t,x). 

3)  The  coefficients  of  the  slowly  varying  equation  come  from  a  cell  problem.  In  this  case 
the  term  of  interest  was  p  and  we  had  to  go  to  0(1)  to  get  the  cell  problem. 
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The  effective  diffusivity  matrix,  a*  is  given  by 


<7;  =  ((£/  +  *)(Vx).e> 

=  ((el  +  V)  (Vx  +  e)  •  (Vx  +  e)) 

where  we  added  Vx  to  e  because  V  •  [(el  +  'I')  (Vx  +  e)]  =  0  (9).  Also  since  (Vx  +  e)  • 
(Vx  +  e)  is  a  quadratic  form  and  VP  is  skew  symmetric,  we  obtain 

a*(e)  =  e(|Vx  +  e|2) 

=  c  +  e(|Vx|2). 

From  tiffs  it  is  clear  that  convection  always  enhances  diffusion  since  cr*(e)  >  e. 


Finally  we  check  convergence  of  the  asymptotic  expansion 


i) 


max  \pn(i..x)  —  p(x, t)|  <  max 
0<t<T,3?€9f2  o  <t<7>es?2 


pi»  +  o(  - 

\n 


<  Ct- 


provided  po  decays  rapidly  at  infinity  and  is  smooth. 


2) 


Vp)  6(t,  x)dxdt 


0 


where  9  is  a  test  function.  This  says  that  on  average  the  gradient  converges.  Calculating 
V pn  we  obtain 


Vp„  =  Vp+vcp(1)(/,  .7;,  nx)  +  ... 


3) 


sup  / 
o<i<r 


2  1 

dx  <  C7"— 
n 


Vp„-  (vp  +  v€p(J)) 

thus  p^)  closes  the  problem  and  allows  us  to  determine  Vpn.  Note  that  3)  implies  2). 


Notes  by  Tiffany  A.  Shaw  and.  Ay  a  Ta.na.be. 
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Lecture  11:  Analysis  of  the  Childress  cell  problem  and 
stability  of  cellular  flows 

George  C.  Papanicolaou 


1  Introduction 

In  the  first  part  of  this  lecture  we  will  discuss  the  Childress  analysis  of  the  cell  flow  problem 
and  apply  to  it  variational  methods.  Furtheremore  the  general  case  of  coupled  Childress 
cells  will  be  briefly  analyzed.  In  the  second  part  of  the  lecture  we  will  discuss  the  stability 
of  2D  cell  flows  for  the  forced  Navier-Stokes  equation. 


2  Childress  analysis  of  of  the  advection-diffusion  problem  for 
a  simple  cell  flow 


In  the  previous  lecture  we  have  seen  that  the  multiscale  analysis  gives  us  the  following  large 
scale  equation 

V[(e/  +  *)(Vx  +  e)]  =  0,  (1) 

where  I  is  the  unit  matrix  and  ^  is  the  matrix  given  by 


$(£,  y) 


(  0  -ip(x,y)\ 

y )  0.  J 


(2) 


The  effective  diffusivity  is  equal  to 


c*(e)  =  ((eJ  +  *)(VX  +  e)-e)  =  a*£(e)  =  ((e/  +  ^)(Vx+e)-(Vx  +  e)  =  €  +  e{Vx-Vx).  (3) 

The  background  flow  is  assumed  to  be  given  by  a  very  simple  velocity  field  u  =  (— dy'ip,  dtp) 
with  stream  function 

y )  =  sin  x  sin  y,  (4) 

which  is  represented  on  Figure  1.  Note  that  due  to  the  symmetries  of  the  flow  (4)  it  is 
sufficient  to  consider  a  quarter  of  the  original  cell  [0,  2tt]  x  [0,27t].  Indeed,  a  fluid  particle 
which  is  initially  contained  in  the  cell  [0,  tv]  x  [0,  tv]  will  stay  in  this  cell  for  all  times,  see 
Figure  1. 

Equation  (1)  can  be  also  written  as 


eAx  +  u  *  Vx  +  e  u  =  0. 


(5) 
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Figure  1:  Simple  cellular  flow  with  stream  function  ^>(x,  y)  =  sin  x  sin  y  within  the  cell 
[0,  27r]  x  [0,  27r]  (a)  and  the  quarter  cell  [0, 7r]  x  [0,7r]  (b). 


Chose  e  —  (1,0)  and  let  p  =  y  +  x.  Then  we  obtain  the  equation 

cAp  +  u  •  Vp  =  0.  (6) 

The  boundary  conditions  are  specified  as  follows 

P(0,  y)  =  0  p(TT,y)  =  0 ;  (7) 

j/(*>°)  =  °  ^P(x,n)  =  0.  (8) 

The  effective  diffusivity  can  be  calculated  as  follows 

<  =  4  r  r  (Vpfdxdy,  (9) 

**  Jo  Jo 

where  cr*(ej)  =  cr*.  To  calculate  the  effective  diffusivity  (9)  boundary  layer  theory  can  be 
applied  [1].  On  dimensional  grounds  the  thickness  of  the  boundary  layer  is  expected  to  be 
of  order  y/e.  Indeed,  the  boundary  layer  can  be  estimated  by  equating  the  convection  time 
scale  tConv  ~  L/Uo  and  and  the  diffusion  time  scale  tdiff  ~  /2  jv  (Uo  is  the  characteristic 
velocity  and  v  is  the  viscosity).  The  quantity  l/L  is  the  width  of  the  boundary  layer. 
Equating  tconv  and  t<\in  we  obtain  ^  ~  ”  and  ~  (l)2-  Since  e  =  ~  (-£)2  it  follows 

that  the  width  of  the  boundary  laye  is  given  by  ~  yfe.  The  same  arguments  apply  to  the 
case  of  more  general  periodic  flows  (discussed  in  [2])  such  as  the  one  given  by  the  stream 
function 

^(x,  y)  —  sin  x  sin  y  +  5  cos  x  cos  y,  (10) 

see  Figure  2. 


Figure  2:  Cellular  flows  with  stream  function  ^(x,  y )  —  sin  x  sin  y  +  S  cos  x  cos  y  for  different 
<5’s. 


Cellular  flows  connected  with  each  other  by  slight  random  deformations  of  saddle  points 
have  been  considered  by  Isichenko  in  [3].  The  deformation  is  assumed  to  be  of  the  form 

'ip(x.y)  =  sin  x  sin  y  +  6^(x,y),  (11) 

where  the  parameter  6  is  assumed  to  be  small  and  the  function  is  random  with  certain 
properties  [4]. 

In  the  periodic  case  the  efflctive  diffusivity  can  be  estimated  by  using  the  fact  that  p 
changes  significantly  only  in  the  boundary  layer.  Therefore  Vp  is  of  the  order  of  1/y/e  and 

°*  ~ €  (tO 

More  precise  results  can  be  obtained  by  using  the  boundary  layer  method.  We  introduce 
boundary  coordinates 


(x,y) — ►  (iM),  0  <ip<l,  — 4  <  0  <  4. 


(12) 


Note  that  ^  is  just  the  value  of  the  stream  function  which  is  equal  to  zero  on  the  boundary. 
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Furthermore  ,  the  level  lines  of  ip  and  8  are  orthogonal 


V-0  •  V0  =  0,  \Vip\  -  |V0|  on  ip  =  0.  (13) 

It  is  suitable  to  rescale  the  coordinate  t/;  in  the  neighborhood  of  the  boundary.  We  define 


The  standard  chain  rule  yields 


and 


(M)= 


<9p  9//,  dp  <9#  dp 
dz  9.x  d/i  +  dx  d8 


Pxx  —  hxxPx  T  hxPhh  T  ‘2h'x@xPk,6  T  8XXP0  T  @xP00’ 

For  6<  1  the  left  hand  side  of  (6)  can  be  written  as 

e(pxx  +  pyy)  ->  -4=A ij}ph  +  IV^lVhh  +  f +  e(V6fpeo- 
ve 

Form  the  condition  (13)  follows 

—0yPz  +  ^xPy  =  •  VOpo  =  |V^>| 2pe  +  h.O.t. 

The  boundary  layer  equation  has  the  form 


(14) 


Phh  +  Po  =  0,  h>  0,  — 4  <  0  <  4  (15) 

with  boundary  conditions 

p(O,0)  =  O,  for  0  <  0  <  2  (16) 

p(0, 8)  =  7r,  for  —  4  <  8  <  —2  (17) 

f£  =  0,  for  —  2  <  6  <  0  and  2  <  04.  (18) 

Finally,  we  obtain  the  Childress  equation 

i  i  ro o  r4 

—Fa*  -»  —  /  /  Pndhd.0.  (19) 

V6  ^  JO  J —4 

This  problem  has  been  treated  by  A.  Soward  in  [5]. 

Finally,  let  us  remark  that  boundary  layer  coordinates  can  be  used  to  give  an  estimation 
of  the  scaling  of  a*  in  the  case  of  random  flows.  As  before  we  suppose  that  the  boundary  is 
given  by  the  level  set  =  0.  However,  due  to  the  randomness  of  the  flow  this  boundary  has 
a  complicated  fractal  structure.  For  small  V;  let  the  characteristic  velocity  at  ip  be  denoted 
as  U (i/j)  and  the  width  of  the  boundary  layer  by  1(i/j).  Just  as  in  the  case  of  periodic  flows  we 
equate  the  diffusion  time  scale  to  the  convection  time  scale  U2(i/j)/e  ~  l{ip)/Uo-  Percolation 
methods  can  be  applied  to  calculate  the  width  of  the  boundary  layer  in  dependence  on  ip 
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[4].  This  gives  l{ip)  ~  ip  7/4.  Since  the  velocity  is  assumed  to  be  smooth  it  follows  that 
U(ip)  ~  ip  and  U0  =  0(1).  Therefore 

ip  ~  e4/15.  (20) 

Supposing  that  the  gradient  of  p  is  of  order  ^  we  obtain  from  (9) 

(2i) 

where  is  the  area  of  the  boundary  layer.  Therefore,  for  random  flows  the  effective 

3 

diffusivity  scales  like  eis. 

3  Variational  analysis 


The  discussion  of  variational  method  results  in  this  section  is  largely  based  on  [2].  Denoting 
E* J  =  VX  +  equation  (1)  becomes 


V-(I  +  *)£+=0 

(22) 

and  satisfies  conditions  V  x  U+  =  0  and  (E+ )  =  0.  We  also  consider  the  adjoint 

problem 

V- (I -*)£"  =  0,  (23) 

with  V  x  Ee2  = 

0  and  (Ee 2)  =  0.  For  convenience  of  notation  define 

K  =  (i + *)£& ,  =  (/  -  *)£*  • 

(24) 

Then  the  effective  diffusivity  becomes 

<y*(ci,c2)  =  (T>+  -e2). 

(25) 

Define  now 

E'n  =  \{EX  -  £")  D'12  =  I(D+  -  D-  ), 

(26) 

£12  =  ^  +  JS" )  A2  =  ^(£>+  +  D-), 

(27) 

(28) 

It  follows  that 

D[2  =  E'12  +  VE12,  S7D'12  =  0,  V  x  E'12  =  0 

Di2  =  En  +  V  •  D\2  =  0,  V  X  E\2  =  0. 

(29) 

(30) 

The  effective  diffusivity  can  be  written  as 

<7*  =  <0+  ■  e2>  =  {1(0+  ■  e2)  +  1(0-  •  e,)  =  1(0+  •  E~)  +  1(0"  •  E+)  = 
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\((D+  +  D~2)(E+  +  E~))  -  \((Dl  -  D~2)(E+  -  E~))  =  {£>12  •  E12)  -  (D[2  •  E'n). 
Then  we  obtain  the  following  matrix  equation 


* 


a 


(31) 


Note  that  the  matrix 


-I 

*  P 

Effective  diffusivity  can  be  computed  as  solution  of  the  following  variational  problem 


is  symmetric  but  indefinite. 


aost(eue2)  = 


inf 


sup 


(F)  =  ,  V x F= 0  (F')=- ,VxP= 0 

where  the  matrix  A(F,Ff)  is  given  by 


{A(F,F')} 


(32) 


A(F,F')  =  ( 


I  V\  / F' 


$  I 


F 


F' 

F 


The  algebraic  technique  which  underlies  this  calculation  is  that  of  a  partial  Legendre  trans¬ 
form. 

We  will  now  give  upper  and  lower  bounds.  First  analyze  the  supremum.  Consider  the 
equation 

VF'  +  V  •  i'i'F)  =  0  (33) 

with 

F'  ==  -  ~62  -  r'I'F,  (34) 

where  TV  A"1  V  is  the  projection  operator  on  the  space  of  divergence- free  vector  fields.  It  is 
easily  verified  that  (34)  gives  (33).  Now  we  plug  Fl  into  the  expression  (32)  setting  e\  —  e?. 
Then  we  obtain  the  following  upper  bound  for  the  effective  diffusivity 

°* (e)  =  inf  x  {e(F-F)  +  hr*F-T*F).  (35) 

VxF=0,{F)=e  e 


Choose  F  =  V/  with  /  =  f(h,6).  Then  the  first  term  (F  •  F)  in  (35)  gives 


|V/|2  =  I  V/i 


2 


(36) 


where  we  have  used  (13).  Since  the  second  term  in  (36)  is  of  order  e  in  comparison  to  the 
first  term  we  obtain 


e(F  ■  F)  ~  4 

7 


2 

dhdd . 


(37) 


Here  we  have  used  the  fact  that  near  the  boundary  J(h,9 )  ~ 

To  calculate  the  second  term  in  (35)  suppose  that  iT’FV/  =  V/'  so  that  f  is  the 
solution  of  the  Poission  equation 


f  A/'  =  ■  V/. 


(38) 
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The  second  term  of  (35)  becomes  now 

I(T®V/  •  rw/)  =  e(V/'  •  V/').  (39) 

To  obtain  f  up  to  the  leading  order  in  e  it  suffices  to  replace  equation  (38)  by 


d7f  df 
dh 2  ~  dO  ’ 


(40) 


where  we  have  again  used  the  fact  that  J(/i,  6)  ~  y/e\ Vh\2  near  the  boundary.  Solving  (40) 
by  direct  integration  we  can  calculate  the  left  hand  side  in  (39)  in  the  same  way  as  we  have 
done  for  e(V/  ♦  V/).  This  gives 


^(r^v/-r^v/)  ~ 


2 

dhd,e . 


Finally  we  obtain  the  following  inequality 


lirn-We(e)  £  A  inf 

C  ►O  y/e  7TZ  / 


(41) 


(42) 


In  a  similar  a  lower  bound  can  be  given.  Note  that  Childress  problem  appears  in  both  lower 
and  upper  bounds  bounds  and  represents  therefore  an  asymptotic  relation  [2]. 


4  Coupled  Childress  problems 


In  each  cell  we  have  different  functions  fi(hi,6)  and  the  following  system  of  Childress  equa¬ 
tions 

+  h>0,  »e[0,y.  (43) 

We  have  to  impose  the  following  boundary  conditions:  /hedges  =  fik{6)>  where  k  is  one  of 
the  edges. 

For  common  interior  edges  we  have  the  conditions 


dfi 

dh 


,  9fj  |  _  n 

+  'dhlh=°-0- 


(44) 


This  allows  us  to  construct  a  network  approximation  for  convection-diffusion  problems  with 
many  cells,  see  Figure  3. 


5  The  Stability  of  Cellular  Flows 

Let  us  consider  the  two  dimensional  Navier-Stokes  equations  driven  by  a  spatially  periodic 
force  F(y ): 


UT  +  (U  •  V)U  =  -Vp  +  ^AU  +  F 

v-u  =  o 
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Figure  3:  Network  approximation  for  problem  of  many  coupled  cells. 


where  the  Reynolds  number  Re  =  ^  is  based  on  a  length  scale  L  which  is  proportional  to 
the  period  of  the  forcing,  taken  to  be  equal  to  27r. 

As  we  are  in  two  dimensions,  the  incompressibility  condition  V  •  U  =  0  implies  that 
there  exists  a  stream  function  <I>  so  that  U  =  (~^). 

Writing  the  Navier- Stokes  equations  above  in  terms  of  the  stream  function  $  we  have: 

|A4>  +  Jw($,  A*)  =  ^A24>  +  /  (45) 

where  Jyy(u,  v)  =  —U2V\  -\-U\v 2-  Here  /  =  +  i*2,i  is  27t  periodic  in  R 2.  It  is  chosen  so 

that  it  gives  rise  to  a  stream  function  (f>  which  is  a  time  independent,  mean-zero,  periodic 
solution  of  the  Navier-Stokes  equations: 

JyyWv ),  A <j>(y))  =  -^-A 2(j)(y)  +  f(y)  (46) 

Let  $(r,  y)  —  <f>(y)  +  $(r,  y)  be  a  perturbation  of  the  stationary  solution  <f>(y).  If  the 
stream  function  of  the  basic  flow  is  an  eddy  of  size  2  ?  that  is  if  (f){y )  is  an  eigenfunction 
of  the  Laplacian 


then  the  driving  force  f(y)  is 


A(f)  =  —  kcj), 

f(y )  =  ~ir<t>(y) 


(47) 
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and  #(r,  y)  satisfies: 


drA$(T,y)  +  Jyy(<j)(y),(k  +  A)$(r,y))  +  Jw($(T,y),  A$(r,y))  =  -^A2l>(T,y)  (48) 

What  concerns  us  here  is  the  stability  of  eddy  flows  like  (46)  and  (47)  subject  to  an 
initial  modulational  perturbation,  a  perturbation  on  a  scale  much  larger  than  that  of  the 
eddy  (see  Dubrulle  and  Frisch ([8]  for  references  about  previous  works  in  this  direction).  For 
this  purpose,  we  introduce  a  small  parameter  e  and  define  the  large-scale  time  and  space 
variable 

t  =  e2r,  x  =  ey  (49) 

respectively  and  analyse  a  special  class  of  asymptotic  solutions  of  (48),  where  $(r,y)  = 
e(t,x )  is  expressed  in  the  large-scale  or  slow  variables  as: 

^€(t,x)  =  \I/(£,  x)  +  etyl(t,x,x/e)  +  e2\k2(£,x,x/e)  +  . . .  (50) 

One  can  derive  (see  [6]  for  details)  from  (48)  the  large  scale  modulational  equation  for 
\I/(£,x)  in  the  vorticity  form: 

dtV2*(t,x)  +  (51) 

(where  we  used  the  convention  Vj  =  ^-). 

The  coefficients  Vj^x  are  the  tensor  of  eddy  viscosity  and  are  the  effective  coef¬ 

ficients  of  another  tensor  which  we  call  the  nonlinear  a-tensor  (see  [6]  for  details).  Both 
tensors  are  derived  as  necessary  solvability  conditions  of  auxiliary  cell  problems  that  guar¬ 
antee  the  validity  of  the  separation  of  scales  for  some  finite  time. 

We  will  consider  a  family  of  cellular  flows  with  a  stream  function 

(j)  =  sin(y1)sin(y2)  +^cos(yi)cos(y2),0  <5  <  1 

All  coefficients  of  the  eddy  viscosity  tensor  Ujiki  but  one,  called  v9  can  be  computed 
analytically.  The  large-scale  modulation  equation  corresponding  to  v9  is: 

f)  Rp^ 

v2*  +  —  (V|  -  V?)[<5((V13')2  +  (V2$)2)  +  (1  +  ^2)Vi^V24']  +  +JXX(V,  V2*) 

Ot  o 

1  Rp  Rp 

=  Te^  -  — [(Vi  +  W2)2  +  ($Vi  +  V2)2]V2*  +  +(—(1  +  62)  +  i/)(V2  -  V?)®  (52) 

The  v '  can  be  computed  numerically  for  Re  <  32,  and  for  closed  cellular  flows  <f)  = 
sin(yx)  sin(y2)  it  can  be  shown  that  v!  =  0(Re 2-5)  for  large  Re.  This  is  done  using  an 
extension  of  the  variational  principles  discussed  earlier  in  this  lecture  (for  details  see  [6]). 
Previously,  Sivashinsky  and  Yakhot  ([7])  and  also  Dubrulle  and  Frisch  ([8])  have  done  a 
small  Reynolds  number  linear  stability  analysis  (see  [7]),  but  in  our  case  we  are  concerned 
with  large  Reynolds  number  flow. 

The  modulational  perturbations  of  closed  cellular  flows  (5  =  0  in  (5))  are  much  more 
stable  than  the  shear  cellular  flows  (<5  =  1  in  (5))  for  large  Reynolds  numbers.  More  specif¬ 
ically,  exponential  solutions  ^(t,x)  =  exp(crt)  exp(fciXi  +  fc2x2)  are  asymptotically  unstable 
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as  Re  — >  oo  only  if  k\  «  ±A~2  for  closed  cellular  flows.  This  result  is  to  be  contrasted  with 
a  similar  stability  result  for  shear  flows,  where  exponential  solutions  are  asymptotically 
unstable  as  R,e  — >  oo  if  C\  <  j/ci  1/1^2 1  £  C2  where  C\  —  I/C2  ~  0.45  ^  1.  It  can  also  be 
shown  that  because  of  the  presence  of  1/  —  0(Re2  b)  for  closed  cellular  flows,  the  stability 
at  high  Reynolds  numbers  is  significantly  better  for  flows  with  closed  streamlines.  Cell-like 

mesoscale  ocean  flows  (which  are  at  high  Reynolds  numers  in  the  range  of  10 - 103)  are 

close  to  closed  celluar  flows,  and  so  the  previous  analysis  may  explain  their  persistance. 

Notes  by  Ravi  Srinivasan ,  Dani  Zarnescu  and  Walter  Pauls. 
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1  Introduction 

Shallow  water  flows  in  channels  are  of  interest  in  a  variety  of  physical  problems.  These 
include  river  flow  through  a  canyon,  river  deltas,  and  canals.  Under  certain  conditions  we 
can  get  large  hydraulic  jumps,  or  their  moving  counterparts  bores,  in  the  channel.  There 
are  a  number  of  places  where  these  bores  are  generated  in  rivers  around  the  world,  including 
the  River  Severn  in  England,  and  the  Amazon  in  Brazil  [6]. 


Figure  1:  A  surfer  riding  a  tidal  bore  on  the  Amazon. 

In  this  work,  we  will  be  concerned  with  the  effect  that  geometry  and  flow  rate  have  on 
the  formation  and  stability  of  hydraulic  jumps.  The  general  setup  is  inspired  by  Al-Tarazi 
et  al.  [2]  and  Baines  and  Whitehead  [4],  The  motivations  are  to  use  the  present  study  to 
investigate  shallow  water  flow  and  also  as  a  tool  for  comparison  with  the  granular  media 
flows  studied  in  [2].  The  idea  being  that  this  will  lay  a  foundation  for  the  study  of  mixed 
media  flows. 
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The  body  of  this  report  is  divided  into  five  sections.  First  we  will  present  the  one 
dimensional  inviscid  hydraulic  theory.  Then  we  will  compare  the  inviscid  theory  with  the 
experimental  results.  Next  we  will  discuss  some  two  dimensional  and  frictional  effects. 
Finally,  we  will  discuss  some  areas  for  future  research  and  make  some  concluding  remarks. 

2  Experimental  setup 

We  conducted  a  series  of  experiments  in  a  linear  flume  with  a  flat  bottom  and  piecewise 
linear  cross  section.  Water  flows  through  a  sluice  gate  at  the  beginning  of  our  channel, 
pours  out  of  the  end  into  a  large  reservoir,  and  is  recirculated  using  pumps.  The  setup  is 
shown  in  Figure  2. 


Figure  2:  The  experiments  were  done  in  a  linear  plexiglass  flume,  where  water  was  recircu¬ 
lated  using  pumps  in  a  large  trash  can,  seen  on  the  right,  downstream  of  the  contraction. 

The  flume  had  a  20  cm  cross  section,  and  was  approximately  1.5  m  in  length.  When  all 
three  of  the  pumps  were  in  operation,  we  could  generate  a  volume  flux  up  to  4  liters/sec.  For 
each  experiment  two  plexiglass  paddles,  30.5  cm  long,  are  fixed  at  a  given  angle  at  the  end  of 
the  channel.  The  flow  rate  is  set  by  turning  on  the  desired  number  of  pumps  and  restricting 
the  flow  until  the  various  flow  states  are  observed.  The  flow  rate  is  then  measured  using  a 
bucket  and  a  stopwatch  at  the  end  of  the  channel.  In  order  to  increase  the  accuracy  of  our 
flow  measurement,  the  discharge  was  measured  a  minimum  of  five  times  and  the  mean  of 
these  measurements  was  taken  as  the  flow  rate.  In  each  experiment  the  height  of  the  fluid 
is  measured  by  placing  a  thin  ruler  in  the  fluid  parallel  to  the  flow  velocity  and  visually 
estimating  the  depth.  Data  was  taken  at  a  variety  of  nozzle  widths  and  flow  speeds.  A 
schematic  of  the  experimental  setup  is  shown  in  Figure  3. 
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150cm 

Figure  3:  A  sketch  of  the  tank  is  given.  The  tank  is  piecewise  linear,  with  paddles  at  the 
end  to  regulate  the  nozzle  width. 


3  One  Dimensional  Inviscid  Flows 

Here  we  will  present  the  mathematical  formulation  of  the  problem  of  flow  through  a  channel 
with  a  contraction.  We  will  derive  the  governing  equations  from  conservations  laws  and  use 
these  to  give  predictions  for  steady  one  dimensional  (1-D)  flows.  Next,  we  will  determine 
the  necessary  flow  conditions  for  moving  shocks,  as  well  as  for  stationary  shocks.  Finally, 
we  will  derive  a  stability  condition  for  steady  shocks  in  a  contraction. 


3.1  Conservation  Laws 


Conservation  of  mass  of  a  constant  density  fluid  in  a  shallow  channel  can  be  written  as 


<d 
d  t 


Xi  fb(x)  rh(x,y,t) 


rxi  rO{x)  r 
Jx  o  JO  Jo 


pdzdydx 


rb(x)  rh(x,y,t) 

=  /  /  p(u{x0,y,t)  -u(xi,y,t))dzdy,  (1) 

Jo  Jo 


where  the  rr-axis  is  measured  down  the  centerline  of  the  channel,  xo  and  x\  are  arbitrary 
points  on  this  axis,  and  t  is  time.  If  we  use  the  divergence  theorem  on  the  integral  on  the 
right  hand  side,  and  take  h  and  u  to  be  independent  of  y  this  becomes 

rx  i 

[ pb(x)h(x ,  t)]t  +  [ pb(x)h(x ,  t)u(x,  t)]x  dx  =  0.  (2) 


pXl 

Jx  0 


Since  xq  and  x\  are  arbitrary  we  get  that  the  argument  of  our  integral  must  be  equal  zero 
pointwise 

(bh)t  +  ( bhu)x  =  0.  (3) 

Here  u  is  the  velocity,  h  the  height  of  the  free  surface,  b  the  width  of  the  channel,  and  p  the 
density  of  the  fluid.  Partial  derivatives  are  written  in  two  ways  as  dt( •)  =  (•)*  and  so  forth. 
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For  water,  p,  is  taken  constant,  and  we  therefore  have  dropped  the  p  dependence  from  (3). 
We  can  also  get  a  momentum  equation  using  Newton’s  second  law  of  motion 

f  pu(x,y,t)dV  =  f  pu(x,y,t)2  •  ndA  +  f  FdV  +  (  S  •  neL4,  (4) 
dt  Jcv  Jcs  Jcv  Jcs 

where  d,A  and  dV  are  infinitesimal  area  and  volume  elements.  To  make  the  notation  simpler 
we  have  omitted  the  bounds  of  our  integrals,  instead  writing  CV  for  an  arbitrary  control 
volume,  and  CS  for  the  surface  of  that  volume.  If  we  make  the  assumption  that  the  pressure 
forces  are  hydrostatic,  then  the  acceleration  in  the  vertical  direction  is  negligible,  and  the 
body  forces  must  balance  the  surface  stress  to  give  hydrostatic  pressure  p  —  po  =  pg(h  —  y) 
(see,  e.g.,  [9]),  where  g  is  the  acceleration  due  to  gravity.  We  can  use  Stokes’  theorem  to 
turn  (4)  into  a  volume  integral,  and  after  assuming  again  h  —  h(x^t)^u  =  u(x,t)  we  obtain 

(bhv.)t  +  (bhu2)x  +  ^ gb(h2)x  =  0.  (5) 


3.2  Smooth  Hydraulic  Flow 

In  this  section,  we  are  looking  at  flows  which  have  reached  a  steady  state.  This  allows  us 
to  simplify  (3)  and  (5)  into 

( bhu)x  =  0  (6a) 

(bhv?)x  +  ^gb{h2)x  =  0.  (6b) 

When  the  solutions  are  smooth  we  can  expand  the  derivatives  in  (66)  to  get 

(2 1/2  +  9^)x  =  0.  (7) 

Next,  introduce  the  local  Froude  number,  F  =  u/y/gli.  Eliminating  ux  from  (6)  yields 

~(bh)x  +  bhx  =  0  (8) 

or 

(1  -  F2)bhx  =  F2hbx.  (9) 

Thus  we  see  that  if  F  =  1  then  b  must  be  stationary,  or  in  our  case  at  a  minimum.  Note 

that  the  converse  is  not  true,  when  bx  =  0  we  have  that  F  =  1  or  hx  =  0  but  not  necessarily 

both.  We  will  define  the  flow  to  be  subcritical  when  F  <  1  and  supercritical  when  F  >  1. 
Equation  (9)  tells  us  that  for  smoothly  contracting  6(x),  the  subcritical  fluid  flow  must  have 
a  minimum  in  h  at  the  nozzle.  Similarly,  supercritical  flow  must  have  a  maximum  at  the 
nozzle,  see  Figure  4. 

Next  we  will  examine  for  what  range  of  far  field  Froude  numbers  Fq  =  uo/\/gho  a^d 
contraction  ratios  B  =  6c/6o,  we  can  have  smooth  solutions.  Since  the  flow  is  smooth  we 
can  follow  the  two  constants  of  the  flow 

Q  =  boho'Uo  =  bchcuc  (10a) 

E  =  Uq/2  +  gliQ  =  it^/2  +  ghc.  (10b) 
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Side  view 


Subcritical  Flow 

Supercritical  Flow 

^ ^ 

- " 

\-  Top  view 


Figure  4:  The  shape  of  the  free  surface  of  supercritical  and  subcritical  smooth  flow  through 
a  nozzle.  Both  the  top  view  and  a  profile  are  shown  in  this  figure. 


If  we  non-dimensionalize,  H  =  hc/ho  and  B  =  6c/&o,  then  (10)  is  equivalent  to  the  cubic 
polynomial,  p(H)  =  0,  with  parameters  B  and  Fo,  where 

P(H)  =  H3B2  -  ( +  1)B2H2  +  \f$  =  0.  (11) 


0.1  0.2  0.3  0.4  0.5  0.6  0.7  0.8  0.9  1  1.1 

B 

c 


Figure  5:  The  smooth  solution  boundary  is  plotted  in  the  BcFo-plane,  where  Bc  is  the 
nondimensional  contraction  width,  and  Fq  is  the  ratio  of  the  flow  speed  to  the  characteristic 
flow  rate  of  the  fluid.  We  note  that  Bc  =  1  corresponds  to  a  blocked  flow  and  Bc  =  1 
corresponds  to  a  straight  channel. 

The  stationary  points  of  this  cubic  are  at  H  =  0  and  H*  =  5(5^0  +  1).  Now  since 
physically  meaningful  roots  exist  only  for  H  >  0,  we  can  determine  when  there  are  positive 
roots  by  evaluating  P(H )  at  H  —  H*.  When  p(H*)  <=  0  there  are  positive  roots,  and 
when  p(H*)  >  0  there  are  no  positive  roots.  Thus  the  point  p(H*)  =  0  determines  the 
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boundary  between  smooth  and  nonsmooth  solutions  in  the  Bc.Fo-plane.  This  is  a  standard 
technique  in  hydraulic  theory  [7].  We  obtain 


3  (Fo  \  2/3 

2  [bcJ 


-(i  +  ±f02)  =  o. 


(12) 


Figure  5  illustrates  how  for  a  given  geometry  Bc  and  incoming  depth  ho  there  is  a  maximum 
speed  at  which  a  subcritical  smooth  flow  can  pass.  It  is  also  interesting  to  note  that  for 
channels  with  expanding  width  B  >  1  we  can  find  a  smooth  flow  regardless  of  the  speed. 
In  the  next  section  we  will  look  at  non-smooth  flows. 


3.3  Upstream  Moving  Bores 

Here  we  will  look  for  solutions  with  a  discontinuity,  or  jump,  at  one  point.  We  will  allow 
this  jump  to  move  upstream  at  speed  s,  where  s  is  positive  when  moving  to  the  left,  as  in 
Figure  6. 
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Figure  6:  Both  the  free  surface  profile  and  the  planar  view  are  shown.  Here  we  have  a  shock 
moving  upstream  with  speed  s.  Conservation  laws  will  be  used  to  couple  the  fluid  motion 
between  points  xq,  x\ ,  and  xc. 

As  in  Figure  6,  we  will  pick  a  point  upstream,  xo,  one  between  the  jump  and  the  nozzle, 
X],  and  the  point  of  minimum  width  at  the  nozzle,  x^.  We  will  label  the  width,  height, 
and  velocity  at  these  points  with  subscripts  that  match  the  respective  points.  The  goal  is 
to  find  in  which  regions  of  the  j3cF()-plane  there  exist  shock  solutions.  We  can  couple  the 
flow  at  points  xq  and  xq  using  Bernoulli’s  equation  (13c)  and  conservation  of  mass  (136)  as 
before.  The  flow  at  points  xq  and  xq  can  be  coupled  by  conservation  of  mass  in  the  frame 
of  the  jump  (13a)  and  a  jump  condition  (13d)  which  we  can  derive  from  the  conservative 
form  of  the  momentum  equation.  Consequently,  we  have  four  equations  for  five  unknowns, 


102 


so  we  impose  the  restriction  that  flow  is  critical  (13e)  at  the  nozzle,  and  obtain  the  system 


(uq  +  s)hobo  —  (u\  +  s)h\b\  (13a) 

u\h\bi  =  uchcbc  (13b) 

1  1 

-uf+ghi  =  -u2c+ghc  (13c) 

(»o  +  »)2  =  +  (13d) 

u2c  =  ghc.  (13e) 


Taking  a  critical  condition  at  the  nozzle  is  a  common  assumption  in  hydraulics.  It  is 
equivalent  to  imposing  the  restriction  that  there  are  no  waves  at  infinity  [5].  Now,  we  have  a 
system  of  five  equations  for  five  unknowns  uc,  hC)  h\,s,  with  parameters  ho,  wo?  ^1  =  &0)  bc. 
Nondimension alizing  B  =  6c/6o,^o  =  uo/^gho,  Hi  =  hi/ho^S  =  s/y/gho ,  system  (13) 
simplifies  to 


i(F0  +  (l-tfi)S)2 
(F0  +  S)2 


3J/i2(Fo  +  (1~Hl)5)2/3  H 

1^(1  +  ^). 


(14a) 

(14b) 


Figure  7  shows  the  region  of  the  BcFo-plane  where  (14)  has  physically  meaningful  solutions. 
This  region  was  obtained  by  first  fixing  H\  and  then  finding  the  solution  curves  for  S  and 
then  fixing  S'  and  finding  the  solution  curves  for  H\.  The  boundaries  correspond  to  smooth 
flow  H\  =  1  and  steady  shocks  S  =  0. 
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3.4  Hydraulic  Jumps  in  the  Contraction 

Here  we  examine  for  which  flow  rates  Fo  and  contraction  widths  Bc  there  can  be  steady 
shocks.  Steady  shocks  in  a  contraction  are  solutions  to 


u0hob0  =  Uihibi 

=  U2h2bi  =  uchcbc 

(15a) 

l-u\  +  gho 

=  \u\  +  ghi 

(15b) 

1  2 

-u2  +  gh2 

1  2 

—  2Uc  +  9hc 

(15c) 

u! 

=  + 

(15d) 

ut 

=  ghc- 

(15e) 

These  seven  equations  are  mass  and  momentum  balance  between  four  locations  plus  the 
critical  condition.  The  four  locations  are  the  far  upstream,  ho^uo^bo,  the  upstream  limit 
of  the  shock  iai,/ii,6i,  the  downstream  limit  h2,U2,bi)  and  the  nozzle  uC)hc,bc.  If  we 
nondimensionalize  as  follows,  H\  =  hifh$,  #2  =  ^2/^0?  Bc  =  6c/6o,  then  we  can  reduce 
(15)  to 

\H'l  -  (\fS  +  1)J?!  +  ijBTjJifj  +  H'i 

We  will  use  (16)  to  find  where  in  the  FoB-plane  we  have  steady  shocks.  A  simple  way  to  do 
this  is  to  consider  what  the  boundaries  of  this  region  should  be.  If  we  have  a  shock  we  know 
from  the  energy  condition  that  H\  <  H2  [3].  Now  if  we  look  at  where  this  upper  bound  on 
H 1  is  satisfied  with  equality  Hi  =  H2,  we  can  then  reduce  (16)  to 

5f°2+i-!(|)2/s=o’  (i7) 

which  is  the  boundary  (12)  of  smooth  solutions  we  already  determined.  This  boundary 
came  from  considering  an  upper  bound  on  Hi.  The  other  boundary  should  then  come  from 
a  lower  bound.  Since  we  are  working  here  with  supercritical  flow  in  a  contracting  region, 
we  expect  Hi  to  grow  the  farther  we  move  into  the  contraction.  Thus  the  other  boundary 
should  be  when  the  shock  is  at  the  mouth  of  the  contraction,  or  when  H\  =  1.  Substituting 
this  into  equations  (16a)  and  (1 6b)  yields 

5  +  16 Fq  +  6(g)2/3  -  s/l  +  8-Fq  -  6(^)2/3  V1  +  8Fo  =  0.  (18) 

This  is  the  limiting  curve  we  found  for  moving  shocks  when  the  speed  goes  to  zero.  Thus 
we  have  steady  shocks  in  the  contraction  only  in  the  wedge  of  Figure  7  where  we  had  both 
smooth  solutions  and  upstream  moving  shocks. 

Next  we  examine  the  stability  of  steady  shocks.  Consider  a  system  with  a  steady  shock 
in  the  contraction  region,  with  u\  and  hi  the  upstream  limit  of  the  velocity  and  height  at 


=  0  (16a) 

=  0.  (16b) 
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Figure  8:  Sketch  of  the  unperturbed  solution  (solid  curve)  juxtaposed  over  the  perturbed 
shock  (dashed  curve). 

the  shock  and  I12  the  downstream  limit.  Now  we  will  assume  a  small  perturbation  which 
generates  a  shock  moving  at  a  small  speed  6*,  see  Figure  3.4. 

Let  us  present  equations  which  govern  the  perturbed  flow.  The  perturbations  are  de¬ 
noted  with  a  superscript  e.  The  perturbed  flow  balances  mass  and  momentum  over  the 


shock 

(u\  +  u\  +  s)(hi  +  h\)  =  {'U2  +  ue2  +  s)(h,2  +  h\)  (19) 

(?/]  +  u\  +  s)2(h]  +  h\)  +  ^(/i]  +  h\)2  =  ( U2  +  u\  +  s)2{h  2  +  h\)  +  ^(^2  +  •  (20) 

Mass  will  be  conserved  upstream  of  the  jump 

('Uj  +  u\)(b  +  bc)(hi  +  h\)  =  Q.  (21) 

We  will  assume  that  the  perturbation  does  not  affect  the  far  field  momentum  upstream  Ej 
or  downstream  so  the  Bernoulli  constants  are  unchanged 

-{it,}  +^j)2  +  g{h\  +/ij)  =  Ei  =  -Ui  +  gh]  (22) 

-(u2  +  U2)2  +  g{h  2  +  /4)  =  E2  =  -v-2  +  gh  2-  (23) 

Now  we  assume  that  we  have  a  small  perturbation  and  small  resulting  shock  speed.  Lin¬ 
earizing  all  these  equations  gives  a  linear  system  of  six  unknowns  and  five  equations 

u\  /i  j  b  +  U\ h\  be  +  u\ bh\  =  0  (24a) 

u\h\  +  sh\  +  u\h\  —  v\}i2  —  sfi2  —  U2I12  =  0  (24b) 

2/ti  U]  (x/.j  +  s)  +  h\u*  +  gh\  h\  —  2/1^2  (*4  +  s)  —  —  <7/12^2  =  0  (24c) 

u\u\  +  gh\  —  0  (24d) 

U2U2  +  5/4  =  0.  (24e) 


The  goal  is  to  reduce  this  to  a  single  equation  for  the  perturbed  shock  speed  s  in  terms  of 
the  change  in  channel  width  bc.  If  bc  and  s  have  opposite  signs  and  we  are  in  a  contraction, 
then  the  solution  is  stable,  because  the  shock  speed  will  force  the  shock  back  to  its  previous 
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Unstable 


Stable 


Figure  9:  A  sketch  of  the  contraction  region  as  viewed  from  above.  The  solid  vertical  lines 
denote  shocks.  In  a  contracting  region  steady  shocks  are  unstable,  while  in  a  dilation  they 
are  stable. 


position.  If  instead  they  have  the  same  sign,  then  it  is  unstable,  because  the  shock  will 
propagate  away  from  its  previous  location,  as  shown  in  Figure  9. 


After  some  algebra  we  obtain  the  relationship 


5  = 


(1-fe) 


(25) 


Here  S  =  s / y/ghi  and  Be  =  i>e/6o*  We  know  that  across  a  physical  shock,  h%  >  h\  and 
also  that  u<i  <  U\,  so  this  gives  that  sign(S')— sign(Be).  Details  of  the  above  calculation  are 
found  in  an  appendix  to  this  work  [1]. 


4  Results 

Experiments  were  conducted  over  a  variety  of  flow  speeds  and  channel  geometries.  In  each 
experiment,  the  contraction  width  Bc  was  set  and  then  the  flow  rate  Q  was  varied.  The 
inflow  height  was  kept  fixed  for  all  experiments  at  ho  =  1.3cm.  The  upstream  channel 
width  was  also  a  constant  bo  =  20cm.  For  each  flow  rate  we  recorded  the  category  of  the 
flow,  either  smooth  flow,  moving  shock,  steady  shock,  or  oblique  shock.  These  flow  state 
are  depicted  in  Figure  10.  For  moving  shocks  the  speed  and  height  ratios  across  the  shock 
were  measured.  There  is  an  experimental  difficulty,  in  that  we  cannot  measure  the  speed  of 
fast  moving  shocks.  When  measuring  a  flow,  there  is  a  time  delay  between  when  we  initiate 
the  flow  and  when  it  reaches  a  steady  state.  In  this  experiment  the  time  delay  is  on  the 
order  of  five  seconds.  Thus  for  flows  with  shock  speeds  larger  than  15  cm/sec,  the  shock 
will  move  to  the  end  of  our  channel  before  we  can  properly  measure  the  speed.  The  data 
for  the  moving  shocks  where  we  could  measure  both  the  speed  and  the  height  are  found  in 
Table  1. 

In  addition  to  measuring  the  speed  of  moving  shocks,  we  also  took  measurements  when 
we  had  oblique  shocks  in  the  flow.  Oblique  shocks  are  a  stationary  phenomena  in  our  flow, 
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Figure  10:  Shown  are  sketches  of  the  four  types  of  flow  behavior.  Each  sketch  shows  a 
profile  of  the  flow  and  a  planar  view. 


Bc  =  bc /b0 

Fq  =  Uq/ y/gho 

//]  =  hi /ho 

S  =  s/y/gh/) 

0.6 

3.19 

3.54 

0.15 

0.6 

3.55 

3.77 

0.08 

0.7 

2.31 

2.54 

0.11 

0.7 

2.40 

2.85 

0.19 

0.7 

2.49 

2.69 

0.05 

0.7 

2.80 

3.08 

0.08 

0.7 

2.98 

3.23 

0.02 

0.8 

2.10 

2.31 

0.08 

0.81 

2.20 

2.62 

0.09 

0.875 

2.07 

2.23 

0.06 

Table  1:  The  experimental  data  for  moving  shocks  are  presented  here.  Bc  is  the  nondimen- 
sional  contraction  ratio;  Fq  the  upstream  Fronde  number;  H\  the  nondimensional  height 
ratio  across  the  shock;  and  S  the  nondimensional  shock  speed. 
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so  we  do  not  have  the  difficulty  of  measuring  speed  as  in  the  moving  case.  There  is  a  new 
difficulty.  When  oblique  shocks  are  very  weak,  surface  tension  effects  will  become  impor¬ 
tant,  and  rather  than  a  shock,  we  see  capillary  waves  in  our  contraction  region.  A  picture 
of  this  phenomena  is  shown  in  Figure  11. 


Figure  11:  Weak  shocks  can  be  distorted  by  capillary  waves. 

Since  we  only  want  to  measure  oblique  shocks,  we  need  a  criterion  to  determine  when 
we  have  an  oblique  shock  and  when  we  have  capillary  waves.  The  criteria  used  here  is  that 
when  there  is  a  measurable  height  difference  between  the  fluid  upstream  and  downstream 
of  the  front,  we  call  it  an  oblique  shock.  When  the  mean  fluid  height  is  the  same  on  both 
side  of  the  front  we  call  it  a  capillary  wave.  The  data  from  the  oblique  shocks  we  measured 
are  presented  in  Table  2. 


o 

ii 

Fo  =  uo/Vgho 

Be 

Os 

1.9 

2.79 

9.5 

26.7 

1.5 

2.94 

3.8 

26.7 

1.7 

3.13 

9.5 

27.1 

1.5 

3.32 

3.8 

21.6 

1.5 

3.37 

5.7 

22.1 

1.9 

3.47 

7.6 

25.4 

1.7 

3.56 

9.5 

20.1 

1.8 

3.65 

7.6 

25.2 

Table  2:  The  experimentally  measured  flow  variables  for  the  oblique  shocks  are  presented 
here.  Hi  is  the  nondimensionalized  height  ratio  across  the  shock,  Fo  is  the  upstream  Froude 
number,  9C  is  the  angle  of  the  contraction,  9S  is  the  angle  of  the  shock,  see  also  Figure  10. 
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We  measured  the  flow  rate  and  geometry  for  every  experiment.  The  different  flow  types 
are  plotted  in  the  UcFo-plane  in  Figure  12. 


Figure  12:  Experiments  plotted  in  the  predicted  inviscid  state  space.  Here  circles  are  oblique 
shocks,  diamonds  are  steady  shocks,  squares  are  moving  shocks,  pluses  are  smooth  flows. 
Representative  error  bars  are  plotted  on  a  smooth  flow  and  a  moving  flow  at  Bc  =  0.875. 
The  thick  lines  arc  the  numerically  computed  boundaries  for  the  regions  where  an  upstream 
moving  shock  can  be  stopped  via  friction. 
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5  Discussion 


In  this  section  we  will  compare  the  experimental  results  to  mathematical  theory.  The  models 
used  in  this  report  assume  no  effects  of  surface  tension  or  viscosity.  The  importance  of 
these  effects  are  commonly  measured  using  nondimensional  numbers,  Weber  We  for  surface 
tension  and  Reynolds  Re  for  viscosity.  Table  3  shows  the  range  for  these  parameters  in  the 
experiments  considered. 


Parameter 

Min 

Max 

Reynolds  (Re  =  UL/v) 
Weber  (We  =  pU2L/a ) 
Froude  (F0  =  U0/\/gho) 
Contraction  (Bc  =  bc/bo) 

1,300 

1.85 

0.28 

0.6 

20,100 

433 

4.34 

0.875 

Table  3:  The  nondimensional  parameters  are  estimated  for  the  main  body  of  our  flow.  Here 
We  &  14 Fq  and  Re  &  3300To.  If  we  look  at  some  local  phenomena,  for  instance  near  weak 
oblique  shocks,  we  can  have  smaller  Weber  and  Reynolds  numbers. 


If  we  look  at  Equation  (14)  we  see  that  for  a  given  upstream  Froude  number  Fq  and 
nondimensional  shock  speed  S',  we  can  predict  the  height  ratio  across  the  shock.  We  can  then 
compare  this  prediction  to  the  measured  height  ratios  across  the  jump.  This  comparison  is 
shown  in  Figure  13. 


5.1  Oblique  Shocks 

All  the  analysis  at  the  beginning  of  this  report  considered  only  1-D  phenomena.  Oblique 
shocks  are  a  two-dimensional  (2-D)  phenomena,  so  our  model  does  not  take  them  into 
account.  Following  [2]  and  [8],  we  can  derive  a  system  of  equations  for  the  oblique  shock 
angle  6S  and  shock  height  hi.  These  equations  will  allow  us  to  predict  0S  and  h\  given  the 
upstream  conditions  ho,  To  and  the  angle  of  the  contraction  0C,  as  follows 

(26a) 

ho  tan(0s  -  6C ) 


sin  Qs 


£<i+£>. 

ho  ho 


(26b) 


Using  (26)  we  can  plot  our  predicted  oblique  shock  angles  against  the  experimental  ones. 
This  plot  is  shown  in  Figure  14. 


In  our  experiments  we  saw  oblique  shocks  that  exit  our  channel  before  interacting  with 
another  shock,  and  oblique  shocks  that  intersect  in  the  channel,  see  Figure  15.  A  similar 
calculation  was  also  done  which  can  be  used  to  predict  the  angles  of  intersecting  oblique 
shocks. 


Ill 


Figure  13:  The  experimental  height  ratios  are  plotted  against  the  inviscid  predictions.  The 
dotted  curve  (diamonds)  corresponds  to  the  experimental  measurements;  the  dashed  curve 
(triangles)  to  the  inviscid  predictions.  The  experimental  measurements  are  systematically 
lower  than  the  predicted  curve  due  to  the  effect  of  friction. 
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Figure  15:  The  left  image  is  a  single  oblique  shock  in  an  asymmetric  contraction,  Fo  = 
3.56,  Bc  =  0.75.  The  right  image  shows  the  intersection  of  two  oblique  shocks  in  a  symmetric 
contraction,  Fq  =  3.65,  Bc  =  0.7. 
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5.2  Turbulent  Drag 

If  we  examine  Figure  12  we  see  that  there  are  steady  shocks  outside  of  the  region  predicted 
by  the  inviscid  theory.  To  explain  this,  we  must  reexamine  the  assumptions  of  the  original 
model.  The  real  fluid  does  have  some  viscosity,  so  we  should  take  this  into  account.  In  the 
hydraulic  equations,  viscous  effects  are  usually  added  with  a  drag  term.  The  most  physical 
drag  term  comes  from  a  quadratic  drag  law  [3].  Adding  such  a  term  changes  the  hydraulic 
equations  (6)  to  the  system 


uux  +  ghx  =  ~CdV2/h  (27a) 

{buh)x  =  0.  (27b) 

This  system  of  ordinary  differential  equations  (ODE’s)  can  now  be  solved  using  standard 
numerical  techniques.  When  we  do  not  have  smooth  solutions  we  can  find  steady  shocks 
by  using  the  inflow  conditions  at  the  sluice  gate  and  critical  condition  at  the  nozzle  as 
boundary  conditions  to  march  the  solutions  together  until  they  match  with  a  shock. 


< - > 

150cm 


Figure  16:  A  cartoon  of  the  numerical  method  for  finding  steady  shocks.  We  use  an  ODE 
solver  to  find  the  smooth  flow  with  a  prescribed  upstream  Froude  number  Fo  and  the  smooth 
flow  that  meets  the  critical  condition  Fc  =  1.  These  two  smooth  flows  are  then  matched 
using  the  shock  condition. 

Depending  on  the  Froude  number  Fo  and  geometry  Bc,  we  may  or  may  not  be  able  to 
have  a  steady  shock  of  this  type.  If  we  solve  this  system  throughout  our  state  space  we  get 
numerically  computed  boundaries  for  when  we  can  have  steady  shocks  with  friction.  These 
boundaries  are  plotted  in  Figure  12.  For  our  computations  we  have  used  Cd  =  0.004  [4], 

5.3  Multiple  States 

In  our  inviscid  calculations,  we  predicted  a  region  in  the  FcFo-plane  where  we  can  have 
three  different  steady  states:  steady  shocks,  moving  shocks,  and  supercritical  smooth  flows. 
We  have  shown  that  the  steady  shocks  in  the  contraction  region  are  unstable,  so  we  don’t 
expect  to  see  these.  We  also  have  observed  that  friction  can  stop  slowly  moving  shocks,  and 
that  supercritical  smooth  flows  correspond  to  oblique  shocks.  Thus  this  region  of  multiple 
states  really  corresponds  to  flow  speeds  where  we  can  have  upstream  steady  shocks,  stopped 
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via  friction,  and  oblique  shocks  in  the  contraction  region.  These  phenomena  were  observed 
in  the  lab.  Figure  12  shows  both  oblique  shocks  and  steady  shocks  in  the  same  region  of 
state  space.  We  also  observed  that  large  perturbations  of  these  flows  can  cause  the  flow 
to  change  from  one  steady  state  to  another.  If  we  have  a  steady  upstream  shock,  we  can 
physically  push  most  of  the  water  that  is  behind  the  shock  out  of  the  channel,  and  see  a 
steady  oblique  shock.  If  we  have  an  oblique  shock,  we  can  block  the  flow  for  a  small  time 
period,  and  the  resulting  flow  will  evolve  into  a  steady  upstream  shock.  Figure  17  shows 
snapshots  of  the  transition  from  oblique  shocks  to  an  upstream  steady  shock. 


Figure  17:  Shown  are  snapshots  of  the  flow  transition  from  oblique  shocks  to  a  steady 
upstream  shock.  The  time  interval  between  each  frame  is  1  second.  Here  we  have  the 
Proude  number  Fq  =  2.8  and  the  contraction  ratio  Bc  =  0.7.  A  ruler  is  used  to  restrict  the 
flow  for  a  small  time  period  to  induce  this  state  change. 


6  Conclusion  and  future  work 

We  presented  a  mathematical  and  experimental  investigation  into  shallow  water  flow  through 
a  contraction.  We  began  by  making  predictions  using  a  simple  1-D  inviscid  theory.  We  saw 
that  for  slow  speeds  this  1-D  analysis  performs  well.  For  higher  speeds  boundary  drag  be¬ 
comes  important  and  we  saw  a  departure  from  the  1-D  inviscid  predictions.  The  addition 
of  drag  forces  improved  the  performance  of  the  1-D  theory.  To  predict  oblique  shocks,  a 
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fundamentally  2-D  feature,  we  had  to  use  the  2-D  shallow  water  equations. 

We  also  set  out  to  investigate  the  existence  and  stability  of  steady  shocks.  We  presented 
a  perturbation  method  for  finding  when  steady  shocks  in  the  contraction  are  stable.  Ex¬ 
perimentally  we  observed  that  shocks  which  are  stopped  via  friction  are  stable.  If  we  look 
at  how  the  shock  speed  depends  on  flow  rate,  see  Figure  7,  we  see  that  there  is  a  heuristic 
stability  result  which  agrees  with  our  predictions  and  observations.  When  the  flow  rate  is 
increased,  shocks  move  slower,  and  when  the  flow  rate  is  decreased  shocks  move  faster.  If 
we  apply  this  knowledge  to  a  steady  shock,  we  see  that  in  accelerating  flows,  steady  shocks 
will  be  unstable.  If  we  displace  a  steady  shock  upstream  in  an  accelerating  flow  it  will 
have  a  faster  speed,  and  will  move  upstream.  Also  downstream  displacements  will  generate 
slower  speeds,  and  the  shocks  will  move  downstream.  A  similar  argument  shows  that  in 
decelerating  flows  steady  shocks  are  stable.  This  argument  is  incomplete  however,  in  that 
it  does  not  deal  with  flows  where  the  velocity  is  not  monotone.  This  is  precisely  the  case  of 
a  steady  shock  in  a  contraction,  so  here  we  used  the  perturbation  method  of  [4]. 

There  are  a  variety  of  avenues  for  future  research  illuminated  by  the  experiments  and 
analysis  presented  here.  First,  we  have  observed  that  supercritical  flows  that  are  1-D  smooth 
have  additional  2-D  shock  structure  which  is  not  accounted  for  with  1-D  theory.  In  the  ap¬ 
pendices  of  this  report  [1]  we  have  predictions  for  some  of  these  2-D  structures.  We  also 
observed  a  structure  like  a  Mach  stem  near  the  intersection  of  two  oblique  shocks.  This 
structure  has  not  been  accounted  for  in  the  work  presented  in  this  report.  Another  avenue 
for  future  research  is  to  use  this  work  in  conjunction  with  [2]  as  a  base  for  investigating  shal¬ 
low  flow  of  composite  media,  i.e.  water  carrying  sediment.  Also  this  report  does  not  include 
analysis  of  the  time  dependent  problem.  Here  we  could  investigate  the  relationship  between 
initial  data  and  steady  state  in  the  region  of  multiple  steady  states.  Future  work  is  currently 
being  done  to  compare  2-D  simulations  with  experimental  results.  A  few  experiments  have 
been  done  on  Mach  stems  and  adding  polystyrene  beads  to  simulate  granular  media.  For  up¬ 
dates  on  the  current  state  of  the  work,  see  http://www.math.wisc.edu/~akers/contraction. 
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Figure  18:  Here  are  snapshots  of  an  experiment  where  polystyrene  beads  are  impulsively 

dumped  into  a  flow  that  exhibits  three  states:  upstream  shocks,  Mach  stems,  and  oblique 

shocks.  The  flow  begins  in  the  state  of  oblique  shocks,  with  Fo  =  3.08,  Bc  =  0.7.  Beads  are 

dumped  into  the  flow,  and  the  resulting  flow  is  an  upstream  shock. 
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Bounds  on  Multiscale  Mixing  Efficiencies 
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1  Introduction 

We  are  motivated  by  the  following  problem:  given  the  heat  sources  and  sinks  on  the  Earth  and 
a  stirring  field  with  certain  statistical  properties  what  are  the  maximum  and  minimum  possible 
degrees  of  temperature  variance  that  must  be  present?  The  flow  in  the  Earth’s  atmosphere  is  an 
example  of  a  stirring  field  which  acts  to  redistribute  heat.  Here  we  are  interested  in  the  optimal 
solution  to  this  redistribution  problem.  We  expect  any  stirring  would  suppress  fluctuations 
in  the  temperature  field  and  lead  to  a  more  uniform  distribution,  but  what  kind  of  stirring 
minimizes  the  variance?  Can  we,  given  some  bulk  statistical  properties  of  the  stirring  field, 
derive  bounds  on  the  variance?  What  are  the  characteristics  of  a  good  stirring  field  and  how  do 
they  depend  on  the  source  distribution?  Can  we  define  and  estimate  the  eddy  diffusivity  and 
mixing  efficiency  of  a  stirrer  on  different  length  scales?  Here  we  work  toward  answering  these 
questions  by  computing  rigorous  bounds  on  multiscale  mixing  efficiencies. 

Bounds  on  mixing  have  important  implications  in  both  physics  and  engineering.  Thiffeault, 
Doering,  and  Gibbon  [5],  hereafter  TDG,  have  shown  how  techniques  used  to  bound  bulk  dissi¬ 
pation  quantities  in  the  N a, vier- Stokes  equation  [2]  can  be  applied  to  the  advection-diffusion  of  a 
passive  scalar  maintained  by  a  steady  source.  They  derived  rigorous  bounds  on  the  scalar  vari¬ 
ance  and  defined  an  equivalent  diffusivity ,  the  diffusivity  required  to  produce  the  same  amount  of 
mixing  in  the  absence  of  stirring.  Plasting  and  Young  [4],  hereafter  PY,  enhanced  that  analysis 
by  including  the  variance  dissipation  as  a  constraint. 

Here  we  construct  bounds  on  the  multiscale  mixing  efficiency  of  a  sitrring  field  for  a  passive 
scalar  maintained  by  a  time  independent  but  spatially  inhomogeneous  source.  We  focus  on 
the  mixing  efficiency  of  a  stirring  field  on  different  scales  by  considering  the  fluctuations  of  the 
variance,  gradient  variance,  and  inverse  gradient  variance.  Comparing  the  three  measures  (the 
variance,  gradient  variance,  and  inverse  gradient  variance)  gives  a  range  of  information  about 
the  stirring  properties  of  a  flow.  It  has  been  recognized  that  Lp  norms  of  the  passive  scalar 
fail  to  quantify  the  stirring  efficiency  of  a  mixing  process  because  they  are  insensitive  to  small 
scale  structures  [1].  We  gauge  the  effectiveness  of  a  stirring  field  based  on  its  ability  to  suppress 
variance  relative  to  the  variance  in  the  absence  of  stirring.  On  all  scales,  a  smaller  variance 
implies  the  velocity  field  is  a  better  stirrer.  Thus,  a  velocity  field  that  gives  a  larger  mixing 
efficiency  than  another  will  be  considered  more  efficient  with  respect  to  a  particular  measure. 

Our  approach  for  bounding  the  multiscale  mixing  efficiencies  follows  that  of  TDG.  The 
bounds  are  derived  in  section  3.  In  section  4  we  show  that  the  bounds  for  a  monochromatic  source 
on  the  torus  can  be  saturated.  In  an  effort  to  distinguish  the  three  measures  more  convincingly, 
in  section  5  we  investigate  the  efficiency  of  a  steady  shearing  flow  for  a  monochromatic  source 
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using  boundary  layer  asymptotics.  Finally,  in  section  6,  we  find  bounds  on  the  multiscale  mixing 
efficiencies  for  a  decaying  passive  scalar  maintained  by  a  spatially  inhomogeneous  source. 


2  Advection-diffusion  in  a  space  of  ^-dimensions 


The  advection-diffusion  equation  of  a  passive  scalar  with  a  body  source  s(x)  in  a  d-dimensional 
domain  (d  =  2,  3,  ...)  is: 

r)@ 

—  +  u  •  V#  =  kA6  +  s(x)  (2.1) 

where  s(x)  is  a  source  with  spatial  mean  zero  without  loss  of  generality  and  k  is  the  molecular 
diffusivity.  Since  the  body  source  has  spatial  mean  zero  the  passive  scalar  will  also  have  zero 
spatial  mean.  The  velocity  field  here  is  a  given  steady  or  time-dependent  L2  divergence- free 
vector  field.  The  velocity  field  could  be  a  solution  of  the  Navier-Stokes  equation  or  a  speci¬ 
fied  stochastic  process.  We  will  focus  on  a  particular  class  of  stirring  fields  that  are  steady, 
statistically  homogeneous,  and  isotropic  with  single  point  statistical  properties  characteristic  of 
Homogeneous  Isotropic  Turbulence  (HIT): 


and 


U 2 


Ui(x,  *)  =  0,  Ui(x ,  • )uj(x ,  •)  =  —  Sij 


dui(Xj  *)  duj(x,  *)  __  T2 


n,.(v  ')  —  n  L  —  1  A 

1  ’  dxk  5  dxk  dxk  d 


(2.2) 


(2.3) 


where  where  C/2  :=  (|u|2)  is  the  kinetic  energy  density,  A  =  U /T  is  the  so-called  Taylor  microscale 
for  HIT,  and  the  overbar  denotes  the  steady  average  defined  below.  Let  us  define  the  advection- 
diffusion  operator  and  its  formal  adjoint: 


C  :=  dt  +  u  •  V  —  kA,  C)  —dt  —  u  •  V  —  kA. 


(2.4) 


We  also  define  the  steady  average  (assuming  it  exists), 


F(x) 


1 

lim  -  /  F{x:tr)dtf ) 
t—> °o  t  Jo 


(2.5) 


and  the  space  time  average, 

(F)  :=±J  F(x)ddx.  (2.6) 

From  here  on  our  domain  will  be  a  periodic  box  of  size  L,  i.e.  x  E  Trf,  the  d-dimensional  torus. 
The  Fourier  decomposition  of  the  spatially  dependent  variables  are  written  conventionally  as 

Fht,t)  =  YFk  xF(k)  where  F(k)  = -^  f  e~ik'x.F(x,t)  ddx  (2.7) 

L  dTd 


and  k  =  (27r/L)n  for  n  =  (711,  ...,n<i).  The  L2  norms  of  derivatives  of  the  passive  scalar  will  be 
denoted,  for  example, 

(|V^|2)  =  J^|0(k)|2.  (2.8) 

k 
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The  conventional  non-dimensional  number  measuring  the  relative  importance  of  advection  to 
diffusion  is  the  Peclet  number 


K 


(2.9) 


A  standard  measure  of  the  well-mixedness  of  a  scalar  field  is  its  variance  (presuming  spatial 
mean  zero)  [1].  Weighting  the  variances  at  different  length  scales  introduces  a  family  of  variances 
that  are  sensitive  to  mixing  on  different  scales.  The  variances  (|VP0|2)  for  p  =  1,0,  —1  measure 
mixing  on  small,  intermediate,  and  large  scales  respectively.  To  gauge  the  effect  of  stirring,  the 
variances  are  compared  with  variances  in  the  absence  of  stirring  (\Vp9q\2)  =  «"2(|VPA“15|2) 
for  p—  1, 0,  —1. 

We  define  a  non-dimensional  mixing  efficiency  for  each  scale  p 


” '  (ivp#!2) 


for  p  =  1,0,  —  1 


(2.10) 


which  increases  as  stirring  increases.  The  mixing  efficiency  has  the  advantage  of  depending  only 
on  the  structure  of  the  stirring  and  source  and  not  on  their  scales. 

The  equivalent  diffusivity  is  the  equivalent  amount  of  diffusivity  required  to  achieve  the 
same  degree  of  mixing  in  the  absence  of  stirring  (i.e.  a  corresponding  diffusivity  for  the  diffusion 
equation)  and  is  defined  as 


(IV^ol2)172 

(|VP0|2)l/2  ' 


(2.11) 


Equivalent  diffusivity  should  not  be  confused  with  effective  diffusivity  defined  in  homogenization 
theory.  The  effective  diffusivity  is  defined  in  terms  of  large-scale  transport,  i.e.  in  the  presence 
of  large  scale  gradients  of  the  concentration  (G.  Papanicolaou  lectures)  [6].  The  equivalent 
diffusivity  is  specific  to  the  source  and  stirring.  Lower  bounds  on  the  variances  provide  upper 
bounds  on  the  mixing  efficiencies.  These  bounds  depend  on  details  of  the  source  and  stirring  as 
shown  in  the  next  section. 


3  Bounds  on  the  multiscale  mixing  efficiencies 

Following  the  method  developed  by  TDG  we  derive  bounds  on  the  multiscale  mixing  efficiencies. 


3.1  Bounds  on  variance 


Multiplying  (1)  by  a  smooth,  time  independent,  spatially  periodic  projector  function  </?(x), 
taking  the  space-time  average  and  integrating  by  parts  we  obtain 

(0(u  •  V  +  *A)tp)  =  -{<ps).  (3.1) 


A  lower  bound  on  the  variance  is  achieved  via  the  variational  principle 

(62)  >  max  min  {{02)  |  (6( u  *  Vy?  +  nA<p))  =  —  (ips)}.  (3.2) 

v  e 


We  note  that  TDG  derived  bounds  on  the  mixing  efficiency  without  optimizing  over  ip.  The 
optimization  over  0  is  equivalent  to  applying  the  Cauchy- Schwarz  inequality 


(02)  >  max 


(^■)2 

((u  •  V(/5  +  kA if)2) 


(is)2 

<i££V)' 


(3.3) 
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Maximizing  over  ip  is  equivalent  to  minimizing  the  denominator  over  ip.  We  constrain  ip  to  have 
a  unit  projection  onto  the  source.  The  corresponding  variational  problem  is: 

•py  <  njn  {(<pCChp)  \  (<ps)  =  1}.  (3.4) 

Thus  we  must  minimize  the  functional  T  :=  —  ^{}ps  —  1)^,  whose  Euler-Lagrange 

equation  is 

c  sp  _ 

—  =  Ctfip-  ps  =  0  (3.5) 

dip 


where  fi  is  a  Lagrange  multiplier  to  enforce  the  constraint.  The  minimizer  is  (after  some  algebra) 


_  MpS 
^  ( sMqs ) 


(3.6) 


where  :=  {C.O)  1  and 

(sM0s)  =  (s{«2A2  -  V  •  (uu)  +  k(2Vu  :  VV  +  Au  •  V)}_1s).  (3.7) 

Thus  we  obtain  the  lower  bound 

(I e 2)  >  (sMqs)  (3.8) 


which  depends  only  on  the  mean  and  quadratic  correlations  of  the  stirring  field.  For  flows 
satisfying  HIT  this  simplifies  to  the  quadratic  form 

(sA4os)  =  (s{k2A2  —  ( U2 /d)A}~1s ).  (3.9) 


This  lower  bound  on  the  variance  depends  on  the  source  function  and  on  the  stirring.  In 
Fourier  space  it  is  expressed  as 


/02v  >v— J!ML_ 

[  ^  Kk4  +  U2k2/d' 


(3.10) 


An  upper  bound  on  the  variance  may  be  obtained  from  simple  application  of  the  Cauchy- 
Schwarz  and  Poincare  inequalities  to  the  bulk  variance  dissipation  constraint: 


(02>< 

The  variance  in  the  absence  of  stirring  is 


L2  (IV-^I2) 


47 r2 


K 


(3.11) 


{el)  =  3((a-V) 


K 


and  thus  we  obtain  bounds  on  the  mixing  efficiency  from  bounds  on  the  variance 

i-o-\i2  \  -1 

Em 


^  (1A-^|2)  <  2  <  V 

l2  (iv-^i2)  -  M°  -  \E 


|s(k)|5 


k4 


l*(k)l2  V 

k4  +  Pe2k2/d  I  ' 


(3.12) 


(3.13) 
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One  might  expect  the  variance  efficiency  to  have  a  lower  bound  of  1,  implying  that  stirring 
always  decreases  the  variance.  We  can  search  for  a  sharper  lower  bound  by  actually  optimizing 
the  variance  subject  to  the  variance  dissipation  constraint 

(62)  >  min  {(62)  |  k(|V0|2)  =  <s(9)}.  (3.14) 

e 


That  is,  we  can  seek  the  minimum  possible  variance  subject  only  to  the  entropy  production 
balance.  The  solution  of  the  optimization  problem  in  Fourier  space  is 


*10 = i-  i(k> 


2  Kk 2  +  A 

where  A  =  —1/2//  and  //  is  the  Lagrange  multiplier.  Enforcing  the  constraint, 

k2m\2  ov  ij^i2 
,uw*w  =*  =2T 

k 


K 


]>>2|0(k)|2  =  ^0(k)s*(k)  => 


(Kk2  +  A)2 


Kk2  +  A  ’ 


where  *  denotes  the  complex  conjugate. 

In  the  case  of  a  monochromatic  source  this  simplifies  to 


k2 


=  2- 


1 


nk t 


■ .  ,9  wo  —  “  io  ,  ^  2  —  2(Kk2  +  A)  =>  A  —  — — 

(nk2  +  A)2  Kk2  +  A  v  '  2 


and  hence 


(3.15) 


(3.16) 


(3.17) 


0(k)  =  ^2  =  *o(k) 


(3.18) 


which  implies  Mo  >  1. 

In  the  case  of  a  dichromatic  source  (Aq  with  amplitude  6q,  h>2  with  amplitude  52),  the  con¬ 
straint  requires  one  to  solve  a  cubic  equation  for  £  =  A/Aq 


(1  +  T  -(1  +  ®ft  T  4 ft  +  Aof)£0  +  (/3  +  ocft  +  ftft  +  cr)£  +  —  (ft^  +  o/?)  —  0  (3.19) 

where  a  =  C1/C2,  ft  =  //1///2,  cj  =  C2  —  52,  //1  =  acA*},  and  /X2  =  ftA^.  Then  the  mixing 
efficiency  is 


|flo(k)|2  =  4(*  +  ffQ 

l^(k)P  0W+(/3+fF 


(3.20) 


The  efficiency  goes  to  1  in  the  monochromatic  limit  a  — »  0  (£  — »  —1/2)  as  expected.  But  the 
minimum  value  the  efficiency  bound  is  less  than  1  for  V  £  implying  that  the  variance  dissipation 
constraint  is  not  sufficient  to  guarantee  that  there  are  no  stirring  flows  that  could  possibly 
increase  the  scalar  variance.  So  this  analysis  does  not  rule  out  the  existence  of  ineffective 
stirring  fields. 


3.1.1  Delta  function  source 

Here  we  consider  a  ^-function  point  source  (measure  valued)  with  Fourier  coefficients  \s(k)\  =  1 
as  |k|  — >  00.  We  note  this  includes  white  noise  sources.  As  we  are  interested  in  the  high-Pe 
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limit  we  approximate  the  sums  in  (3.16)  by  integrals.  The  asymptotic  form  of  the  upper  bound 
is 


Mo  < 


00  kd~1dk\  (  f°°  kd~ldk 


/  POO 

,J2tx/L 


kA 


L 


Vi  k4  +  z^k2 


Letting  £  =  kL  :  2ix  — »  oo  we  obtain 


For  d  =  2  the  integrals  become 


f<"«-  i'  I, 


°°  \  1 
27r  £4+Pe2£2/dJ  • 


logPe 

2.  £4  +  PeH2ji  ~  Pe2 


resulting  in  the  asymptotic  bound 


In  d=3 


M0< 


Pe 


%/log  Pe 


/;<-•«= h  l 

resulting  in  the  asymptotic  bound 


d( 


2;r  £4  +  Pe2<£2/2  Pe 
Mo  <  \fP~e. 


(3.21) 

(3.22) 

(3.23) 

(3.24) 

(3.25) 

(3.26) 


An  efficiency  scaling  of  Pe  leads  to  an  eddy  diffusivity  proportional  to  UL  from  (3.2).  For  d  =  3 
there  is  a  dramatic  modification  to  the  scaling  that  implies  the  eddy  diffusivity  is  proportional 
to  y/R. 


3.2  Bounds  on  the  gradient  variance 

Beginning  with  the  first  step  in  the  TDG  procedure 

(0(u  *  V  +  K.A)(p)  =  —  (ips)  (3.27) 


we  integrate  by  parts  and  apply  the  Cauchy-Schwarz  inequality  to  obtain 

(<ps)2  =  ((u<p  +  kV<p)  •  V0)2  <  (|u<p  +  kV<p|2)(|V0|2).  (3.28) 


The  sharpness  of  this  bound  is  discussed  at  the  end  of  this  section.  Continuing  as  usual,  we 
construct  a  variational  principle  to  obtain  a  lower  bound  on  the  gradient  variance 


(\V9\2)  >  max 


(ps)2 

(|u<p  +  mV<p|2)’ 


(3.29) 


Thus  we  minimize  the  denominator  subject  to  the  constraint  of  <p  having  a  unit  projection  on 
the  source.  Under  the  homogeneity  and  isotropy  assumptions  of  HIT  the  variational  problem 
becomes  one  of  evaluating 


min  {(k|V</?|2  +  C/2<p2)  \  (ips)  =  1}. 
v 


(3.30) 
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We  want  to  minimize  the  functional  T  (^(k|V(^|2  +  U2<p2)  —  //( s<p  —  1)).  The  solution  of  the 
optimization  problem  is  (after  some  algebra) 

<|V0|2)  >  (sMis)  (3.31) 


where  M\  =  (— k2A  +  Z72)"1. 

A  sharp  lower  bound  on  the  gradient  variance  is  easily  proven.  Upon  taking  the  inner  product 
of  6  with  the  advection-diffusion  equation  we  obtain  the  variance  dissipation  constraint 

K(|W|2)  =  (s6).  (3.32) 

Inserting  V_:IV  =  1  on  the  right  hand  side,  integrating  by  parts  and  applying  the  Cauchy- 
Schwarz  inequality,  we  deduce 

k2( \V6\2)  <  (IV-^I2)  =  k2(|V0o|2>  =>  Mi  >  1.  (3.33) 

Hence  stirring  always  reduces  the  gradient  variance  which  wras  not  proven  for  the  variance 
(previous  section). 

Given  the  upper  and  lower  bounds  on  the  gradient  variance  we  can  bound  the  small  scale 
mixing  efficiency  according  to 


1  <  M? 


<  1  (iv-1*!2) 

~  K2 


In  Fourier  space  this  is 


1  <  M2  < 


v r  k2+£ 


These  bounds  only  make  sense  when  the  sums  on  the  right  hand  side  converge. 
Note  that  if  — ►  0  and  if  $(x)  £  L2  then 


(lV-]6i2)U2  _  U2£2 
(s2)  K2  K2 


(3.34) 


(3.35) 


(3.36) 


where  £s  =  (|V~15|2)1/2/(s2)1//2.  So  if  s  6  L2  then  M\  <  Pes,  but  if  s  £  L2  then  Mi  =  1  i.e. 
there  is  only  suppression  of  gradient  variance  if  U(\ V~1$\2)1/2/k,(s2)1/2  1. 

Here  we  re-examine  our  application  of  the  Cauchy-Schwarz  inequality  which  was  the  first  step 
when  deriving  an  upper  bound  on  the  gradient  variance.  In  fact  the  analysis  can  be  improved  a 
bit.  We  expect  the  bound  to  involve  only  the  curl-free  part  of  the  field  u <p+  kV(/x  This  can  be 
seen  by  first  evaluating 


min  {(|V0|2)  |  (<ps)  =  {{u<p  +  KV<p)  ■  S/9)}  (3.37) 

6 


with  functional  T  (^|V0|2  +  A(v  ■  V#  —  ips))  where  v  =  uip  +  kV</?.  The  solution  to  the 
variational  problem  is  (after  some  algebra): 


<|V6»|2) 


_ (V’fi)2 _  >  (<Fg)2 

((V-v)(-A-i)V-v>  -  (|v|2)‘ 


(3.38) 
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The  inequality  follows  immediately  from  examining  the  Fourier  representation  of  the  denomina¬ 
tor.  Note  that  we  can  decompose  v  as  follows 

v  =  v-VA-1V-v+.VA-1V-y.  (3.39) 

v - v - '  N - V - ' 

divergence  —  free  curl  —  free 


The  denominator  above  is 

((V  •  v(-A_1)(V  •  v))  =  ([AA-1(V  •  v)](— A-1)(V  •  v)) 

=  (V(A-1(V  •  v)  •  V(A_1(V  •  v)) 

=  (IVA-^-vI2).  (3.40) 

From  this  it  is  clear  that  the  explicit  optimization  over  6  yields  a  sharper  bound  by  picking  out 
the  component  of  v  which  is  curl-free. 

Interestingly,  this  new  bound  depends  on  the  two  point  correlation  and  involves  a  non-local 
integral  operator  i.e.  for  d  =  2 

((V  •  v)(— A-1)V  •  v)  =  T  J dx  J  dyVx  •  v(x)G(x  -  y) V,  •  v(y)  (3.41) 

where  G(x  —  y)  is  the  Green’s  function  of  —A.  After  integrating  by  parts 

((V  •  v)(— A-1)V  *  v)  =  ^  J dxj dy(-VVG)  :  vv  (3.42) 

where  under  the  homogeneity  assumption, 

vv  =  <p(x)<p(y)u(x)u(y)  +  K2V<p(x)V<p(  y).  (3.43) 

It  is  the  first  term  that  prevents  the  expression  from  collapsing  to  |v|2  (the  second  term 
becomes  K?(A(p)2  after  integrating  by  parts).  The  first  term  depends  on  the  two-point  correlation 
of  the  velocity  field.  Under  the  assumptions  of  HIT,  the  velocity  field  has  single-point  statistical 
properties  and  hence  the  first  term  collapses  to  U2(p2/d  which  implies  that  for  HIT  a  strict 
application  of  Cauchy- Schwarz  (without  minimizing  over  6)  yields  a  sharp  bound.  In  turbulence 
theory,  the  two-point  correlation  for  Homogeneous  Isotropic  Turbulence  is  written  as 

Ui(x,  -)uj( y,  •)  =  6ij9( |x  -  y|)  +  —  |^yj2  ~(f  ~  9)-  (3.44) 

Incompressibility  implies  that  g(r)  =  f(r)  +  rf(r)/(d  —  1).  This  new  bound  introduces  de¬ 
pendence  on  the  two-point  correlation  property  of  the  velocity  field.  The  implication  of  such 
two-point  statistical  properties  on  the  scaling  of  the  bound  will  be  the  subject  of  future  inves¬ 
tigation.  We  note  that  the  Cauchy-Schwarz  bound  cannot  be  improved  for  both  the  variance, 
because  it  is  a  scalar  field,  and  the  inverse  gradient  variance  (next  section)  because  it  manifestly 
involves  a  curl-free  field.  We  will  revisit  the  implications  of  the  two-point  statistical  properties 
when  we  examine  the  bounds  including  scalar  decay  (section  6). 

3.2.1  Delta  function  source 

It  is  clear  that  a  ^-function  or  white  noise  source  (|s(k)|  ~  1)  will  cause  the  sums  in  (3.37)  to 
diverge  in  both  d  =  2  and  3.  Thus,  in  the  case  of  (5-like  sources  or  sinks  the  mixing  efficiency 
bound  is  sharp  and  equal  to  1. 
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3.3  Bounds  on  the  inverse  gradient  variance 

Beginning  again  with  the  first  step  of  the  TDG  procedure 


(<ps)  =  —  ((u  •  Vy?  +  h:A(p)6) 

=  (V(u  •  Vy?  +  K.Aip)  •  VA“~]0) 

<  <|V(u*  V^  +  K,A¥))|2)^(|V"10|2)i  (3.45) 


Continuing  as  usual,  we  construct  a  variational  principle  to  obtain  a  lower  bound  on  the  inverse 
gradient  variance 


<|V  ]#|2)>max 


(ys)2 

(|V(u  •  S/f  +  kAv?|2) 


(3.46) 


Thus  we  minimize  the  denominator  subject  to  the  constraint  of  y?  having  a  unit  projection  on 
the  source,  i.e.  we  evaluate 


min  {(|(Vu)  •  Vy>  +  u  *  Wy?  +  K;VAy?|2)  |  ( y>s )  =  1}.  (3.47) 


The  assumptions  of  HIT  simplify  the  problem: 


|(Vu)  •  Vy>  +  u  *  VVy>  +  ttVAy>|2  =  (Vy^;)  •  uu  •  (Vy?^)  +  Vy>  •  [(Vu)tr(Vu)]  •  Vy>  +  K2|AVy>|2 

=  k2|AV(^|2  +  (r2/d)|V</>|2  +  (U2/d)(A<p)2.  (3.48) 

Thus  the  variational  problem  reduces  to  evaluating 

min  {(k2|VA<p|2  +  (T2  /  d.)\V  <p\2  +  (U2/d){ A<p)2)  |  (<ps)  =  1}  .  (3.49) 

The  solution  of  the  variational  problem  is  (after  some  algebra) 

(|V_10|2>  =  (sM-is)  (3.50) 

A4_i  :=  (k2A3  —  (F2/d)A  +  (U2/d)A2)~* .  A  lower  bound  on  the  inverse  gradient  variance  is 
obtained  from  simple  application  of  the  Cauchy- Schwarz  and  Poincare  inequalities  to  the  bulk 
variance  dissipation  constraint: 

£  »<|vs'2>-  <35i) 

Because  it  uses  the  Poincare  inequality  the  lower  bound  is  only  sharp  if  the  source  is  monochro¬ 
matic  at  the  lowest  wavenumber,  2n/L. 

Given  the  upper  and  lower  bound  on  the  inverse  gradient  variance  and  the  value  in  the 
absence  of  stirring,  (|V_1  A-1s|2),  we  obtain  bounds  on  the  mixing  efficiency  on  large  scales 


256tt8{V-1A-1s|2)  (1  V~ 3  A~ 1g|2) 

L 8  (| V s|2)  “  -  (sM^s) 


(3.52) 


In  Fourier  space. 
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(3.53) 
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For  a  monochromatic  source  the  upper  bound  is 


M_j  <  1  + 


t/2 


+ 


r2 


dK2k2  dn2k4 


=  1  + 


l/2 

dn2k2 


( 


r2  \ 
+  t/2r2  / 


=  1  + 


Ve 2  /  1  \ 

d  V  +a2p; 


(3.54) 


where  we  define  Ve  =  U / nk .  Note  that  the  efficiency  depends  on  the  shear  in  the  flow  directly 
through  T.  Interestingly,  T  allows  for  an  increase  in  the  mixing  efficiency  on  large  scales  via 
stirring  on  small  scales  (coupling  the  different  scales).  We  investigate  this  potential  effect  for  a 
steady  shearing  wind  in  section  5. 

3.3.1  Delta  function  source 

Consider  a  ^-function  source  (measure  valued).  Suppose  k  — >  0  while  U 2  and  T2  are  fixed  then 
the  asymptotic  form  of  the  upper  bound  is 

o  ^  f  f°°  fc^dfcA  (  r°°  kd_1dk  ^ 

MU  < 


(  r°°  k^dk\  (  r 

\J2n/L  J  yj2. 


W/L  ke  +  ^k4  +  ^1k2/ 
Zd-*dZ 


Letting  £  =  kL/2n  :  1  — >  oo  we  obtain 

In  d=2,  letting  rj  =  £2 

rw=i  r _ _ =i  / 

7i  4  4-  A  ti  +  pfy  +  ge?  2  J, 


-1 


c4  .  ,  r2L4 

^  *  47 T2dK,2  ^  '  167r4dK2 


dr) 


re"  <72  j  r  re 
d  S  A2  d 

where  a  =  Pe2 /d  and  (3  —  L2Pe2 /A2d.  In  the  limit  Pe  — »  oo 


??(772  +  ar/  +  0) 5 


i; 


c1^ 


_  i  rc 

^~2  J, 


dr\ 


ij(ai)  +  P) 


C  +  Tf’  +  VV 

After  some  algebra  we  find 

r _ _ ^J_inL  +  ^ 

Ji  ^4  +  Efe+^Ef  W  \  +<*J 

note  that  /?/a  =  L2/ A2.  The  efficiency  bound  becomes  (as  Pe  — >  oo) 


Mi,  <  |- 


PeJ 


,L2 


In 


A2 


In 


(!  +  @) 


Interestingly,  the  prefactor  can  be  larger  for  smaller  scale  flow. 
In  d  =  3 


roc  i  rc 

/  / 


d( 


f4+^e+(#)2^ 

after  a  change  of  variables  77  =  ^  >  oo 

^  „  1  A  r0  dr? 
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Pe(?  +  g) 


r  00 

l  ¥TW 
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hence 


e+^e+(if)2^  Pei 

Ml,  <  - 1 - TV- 

A  (|  —  arctan 


1  A  /  7T  A 

„  =  -=r-  —  —  —  arctan  — 

A7?2  +  l  Pe  L  \  2  L 


(3.55) 


(3.56) 


(3.57) 


(3.58) 


(3.59) 


(3.60) 


(3.61) 


(3.62) 


(3.63) 
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4  Saturating  the  multiscale  mixing  efficiency  bounds 


The  HIT  bounds  on  the  multiscale  mixing  efficiencies  derived  in  the  previous  section  simplify  in 
the  case  of  a  monochromatic  source 


Mi  <  y/l  +  Pe2/k l 

(4.1a) 

M0  <  Vl  +  Pe2/k2d, , 

(4.1b) 

M_i  <  v/l  +  Pe2jk2d  +  L2Pe2/\2k4d 

(4.1c) 

where  we  have  rescaled  [0,L]rf  to  [0,  l]d  so  that  k8  is  a  multiple  of  27r.  We  note  that  each 
efficiency  scales  as  Pe  which  corresponds  to  replacing  the  molecular  diffusivity  by  an  eddy 
diffusivity  proportional  to  UL .  An  anomalous  scaling  results  if  the  efficiencies  are  not  linear 
in  Pe.  Figure  2  from  TDG  showed  the  mixing  efficiency  Mo  versus  Peclet  number  from  direct 
numerical  simulations  (DNS)  of  the  advection-diffusion  equation  for  a  monochromatic  source 
sin  ksx  and  the  Zeldovich  sine  flow  [3].  TDG  discussed  the  possibility  of  saturating  the  upper 
bound  for  the  sin  ksx  source;  however  it  was  clear  from  the  DNS  calculations  that  the  sine  flow 
was  not  the  optimal  stirrer  and  no  other  stirring  field  was  put  forth. 

Here  we  show  that  the  TDG  upper  bound  on  the  variance  for  s(x)  =  sinfcsxi  and  x  €  Td 
i.e.,  x  G  [0,  L]rf,  is  saturated  by  the  sweeping  flow  suggested  by  W.  R.  Young.  Consider  the 
steady  advection-diffusion  equation  with  source  s(x)  =  >/2S  sin  ksx ]  and  uniform  stirring  field 

u(x)  =  (£//Vrf)S:t.!- 


u  m 

yfd.  L.  dxn 

71=1 


~K^dxl 


+  y/2  S  sm(ksxi). 


(4.2) 


We  sweep  on  an  angle  for  a  long  time  (to  kill  the  transients)  and  then  switch  the  sweeping  by 
an  appropriate  angle  (repeating  appropriately)  as  to  satisfy  the  HIT  assumptions. 

Letting  0(x)  =  Yin- 1  Fn(xn)^  we  end  up  with  a  system  of  constant  coefficient  ODEs 


d2F ,  U  dF\  ,  y/2S  .  n  s  n 

~r~2 - 7TT~  + - sm  (kaxi)  =  0 

dx \  y/d.K  dx  1  K 


d~Fn 


U  dFn 


=  0 


for  2  <  n  <  d 


dxl  y/d,K  dx, 

with  periodic  boundary  conditions  Fn( 0)  =  Fn(L)  whose  solution  is 


F\  = 


V2SL 2 


(47t2k2  +  ^j-L2) 


k  s\n(ksXi)  — 


UL 

2\fdi\ 


cos(k$xj] 


Fn  =  0 


for  2  <  n  <  d. 


The  variance  is 


(02)  = 


s2l 4 

r2jTe  i 


K2  + 


(47T2K2  +  ^L2)2  L 

and  since  (6q)  =  S2 L4 / Km4 k2  the  mixing  efficiency  is 


UH? 

47 x2d 


U2L2  I  Pe2 

MO  —  \  1  +  “ — n  ,  n  —  A  / 1  + 


4i\2d.K2 


4n2d 


(4.3a) 

(4.3b) 


(4.4a) 

(4.4b) 


(4.5) 


(4.6) 
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which  is  the  bound  derived  by  TDG.  Given  the  steady  solution  we  can  compute  the  other  two 
mixing  efficiencies.  After  some  straight-forward  algebra  we  find 

Mi  -  M_i  = 

Interestingly,  Mo  and  M_i  are  precisely  saturated  but  M\  is  off  by  a  d  factor.  Since  the  uniform 
flow  lacks  shear  the  large  scale  mixing  efficiency,  M_i,  is  identical  to  the  others. 

The  result  is  fairly  intuitive:  to  reduce  the  variance  (on  any  length  scale)  one  should  simply 
blow  the  source  onto  the  sink  and  vice  versa  —  if  one  can  do  this  simply.  We  note  that  this  type 
of  sweeping  flow  is  somewhat  pathological  in  the  sense  that  it  simply  transports  the  source  onto 
the  sink  which  can  be  done  simply  on  the  torus.  There  is  no  dependence  on  diffusion.  However, 
such  sweeping  flows  are  not  allowed  on  the  sphere  or  in  bounded  domains.  Furthermore,  the 
sweeping  flow  is  not  optimal  for  non-ID  sources.  This  makes  it  clear  that  the  optimal  stirrer  is 
a  function  of  both  the  source  shape  and  the  domain.  Formulating  the  optimization  problem  for 
the  optimal  stirring  field  is  a  subject  of  current  and  future  investigation.  It  is  a  nasty  non-linear 
problem. 

To  emphasize  the  relationship  between  the  source  and  the  stirring  field  which  saturates  the 
bound  we  perform  an  analogous  calculation  to  the  previous  one  (for  d  =  2,3)  however  we  impose 
a  5-function  source  distribution.  Taking  the  Fourier  transform  of  the  steady  ad vect ion-diffusion 
equation  with  s  —  5(x)  we  obtain 

=  («) 

k  k 

where  kd  is  the  dth  component  of  the  horizontal  wavenumber.  Approximating  the  integrals  by 
sums  (we  are  only  interested  in  the  asymptotic  behaviour) 


d  =  2: 

(|V^|2)  =  j 

r2ir 

1  “*) 

r°°  k2p+1dk 

2-n/L  K?kA  +  U2 k'1  cos2  6 

(4.9a) 

d  =  3  : 

<|Vp0|2>  =  j 

1  d<p  I 

0  J I 

r  .  ...  f°°  k2^+1Uk 

0  smddeL/LK2kA  +  u2k2cos2e • 

(4.9b) 

The  variances  in  the  absence  of  stirring  are  found  by  calculating  the  above  integrals  with  U  —  0. 
Straight-forward  evaluation  of  the  integrals  yields 


d  =  2: 
d  =  3: 


Mi-1, 

Mi-1, 


Mq  ~ 


y/Pe 
47 r  ’ 


M—i  ~ 


3 VPe 
87 r 


2 y/Pe 
y/3n 


(4.10) 

(4.11) 


The  anomalous  scaling  in  Pe  suggests  that  the  uniform  flow  is  far  from  the  optimal  allowed  by 
the  bound  for  the  5-function  source  in  both  d,  —  2  and  3.  This  emphasizes  the  source-dependent 
nature  of  the  optimal  stirrer. 

Given  that  the  the  optimal  HIT  stirrer  for  5  —  sin  ksx  was  at  an  angle,  the  calculations  from 
TDG  were  repeated  for  a  tilted  source  and  the  Zeldovich  sine  flow  to  see  if  we  could  get  closer 
to  the  bound.  Tilting  the  source  is  equivalent  to  tilting  the  stirring.  Figure  1  shows  the  results 
of  the  DNS  calculation  for  p  —  1,0,  —1.  The  plot  of  Mq  includes  the  PY  bound.  What  is  clear 
from  this  figure  is  that  for  a  non-optimal  flow  the  three  bounds  scale  differently  in  Pe.  In  the 
next  section  we  investigate  the  bounds  for  a  simple  steady  shear  flow  in  an  effort  to  understand 
the  scaling  in  Pe  as  well  as  to  explore  the  dependence  of  M_  1  on  the  Taylor  microscale. 
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Figure  1:  From  left  to  right,  mixing  efficiencies  Mp  p  =  1,0,  —1  for  the  Zeldovich  sine  flow  with 
source  sin  k(x  +  y).  The  solid  lines  are  the  respective  upper  bounds  from  section  3.  The  dotted 
line  is  the  PY  variance  bound  and  the  dashed  lines  are  the  result  of  direct  numerical  simulations 
with  U  fixed. 


5  Steady  Shear  Flows 

From  the  previous  section  it  is  clear  that  the  most  efficient  stirring  for  monochromatic  sources 
on  the  torus  is  the  sweeping  flow.  Here  we  investigate  the  effect  of  shear  on  the  mixing  efficiency 
on  different  scales  to  understand  the  scaling  behaviour  at  large  Pe  (i.e.  the  scaling  when  we  do 
not  saturate  the  bound).  The  results  would  directly  apply  to  HIT  stirring  analogous  to  blowing 
for  a  very  long  time  (the  sine  flow  having  a  long  period).  From  the  analysis  of  section  3  we 
would  expect  to  see  a  large  difference  between  the  three  norms  for  such  sheared  stirring.  We 
treat  the  simplest  problem  by  considering  the  long  time  behaviour  of  a  passive  tracer  goverened 
by  the  advection-diffusion  equation 

u  -  V0  =  kA 0  +  s(x)  (5.1) 

with  a  stirring  field  u  —  y/2 U  smkuyi  and  source  s(x)  —  V2Ssinksx  (\/2  for  normalization). 
Here  the  domain  is  the  2-dimensional  torus  x  G  T2.  The  non-dimensional  number  governing  the 
amount  of  shear  is  r  =  ku/ks.  We  are  particularly  interested  in  the  limits  Pe  1  with  r  fixed 
and  r  >  1  with  Pe  fixed.  The  solution  takes  the  form 

0(x)  =  f{y)  sin (ksx)  +  g(y)  cos {ksx)  (5.2) 

which  results  in  a  system  of  ODEs: 

-V2Ukssm(kuy)g(y)  =  k  -k*  +  ~  f(y)  +  y/2 S  (5.3a) 

y/2Ukssm(kuy)f{y)  =  k  -k2s  +  A  div)-  (5.3b) 

The  stirring  field  is  an  odd  function  of  y  and  hence  from  (5.4a)  we  deduce  that  g(y)  is  also 
odd  in  y  and  hence  that  f(y)  is  even  in  y.  This  can  also  be  seen  by  integrating  (5.4a)  over  a 
period.  Since  the  functions  f(y)  and  g(y)  are  periodic  we  consider  the  domain  y  G  [0, 1/2]  where 
/  =  2n/p.  We  infer  boundary  conditions  given  the  even-oddness  of  the  functions  /  and  g: 

g( 0)  =  9(1/ 2)  =  0,  /'(0)  =  /'(//: 2)  =  0.  (5.4) 
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Upon  setting  y  =  ksy,  f  =  fUks/S ,  g  =  gUks/S,  r  =  ku/ks,  and  Ve  =  \Z2U/nka  we  obtain  the 
non-dimensional  ODEs 

/(£)  +  1  =  -  sin (ry)g(y)  (5.5a) 

g(y )  =  sin(ry)/(j/).  (5.5b) 

The  next  sections  outline  the  boundary  layer  analysis  (since  the  solution  is  slowly  varying  except 
in  isolated  boundary  layers)  and  regular  perturbation  theory  which  were  used  to  investigate  the 
limits  mentioned  above. 


1 

dz 

Ve 

l  +  dy2_ 

1 

1 - 1 

+ 

Y— 1 

1 

1 _ 1 

Ve 

5.1  Boundary  layer  solution 


This  is  the  limit  Ve  1  and  r  fixed.  Proceeding  as  usual,  we  construct  an  inner  and  outer 
solution.  The  outer  solution  is  obtained  by  expanding  in  powers  of  Ve~l : 

oo  oo 

font  =  5]  Ve~nfn,  gout  =  Y, V e~n9n •  (5-6) 

n=0  n= 0 


Thus,  in  the  outer  region  the  solution  is  approximated  to  leading  order  by 


fout  —  0,  9  out  — 


sin(/cwy)  ‘ 


(5.7) 


The  boundary  layer  scaling  was  determined  from  a  dominant  balance  argument.  The  left  hand 
side  of  (5.6a)  is  1),  V  e  thus  we  choose  e  =  Ve~ 1/3  and  rescale  y:  y  —  y/e  to  achieve  a 
self-consistent  scaling  of  the  leading  order  terms.  Expanding  in  e  according  to 


fin  ~~  ^  ^  e  /n?  9in  —  ^  ^ 


£n9n 


n=— 1 


n—  —  1 


(5.8) 


(note  that  the  leading  term  is  1/e))  yields  at  order  0(  1/e): 

+  rrig- 1  +  1  =  0,  -  rgg-i  =  0.  (5.9) 

Letting  £  =  r1/3??,  F  =  r2/3/_i,  and  G  =  -r2/3#-!  this  simplifies  the  system  of  ODEs  to 

F"  +  £G  +  1  =  0,  G"  -  =  0.  (5.10) 

with  boundary  conditions 

JP'(O)  =  0,  G(0)  =  0.  (5.11") 

The  other  boundary  conditions  come  from  the  requirement  of  matching  to  the  outer  solution: 
F( £)  0  and  G(£)  — >  — 1/£  as  £  — >  oo. 

We  note  that  this  system  of  ODEs  can  be  cast  into  the  Airy  equation  with  a  complex 
argument  <f>(z)  =  F  +  iG  but  we  resorted  instead  to  shooting  to  get  the  solution  numerically. 
The  solution  was  obtained  by  shooting  backward  (which  was  the  more  stable  direction)  from 
the  £  — »  oo  solution  whose  asymptotic  behavior  may  be  deduced  from  (5.10) 

F~-£4+^0’  +  (5‘12) 
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Figure  2:  Comparison  of  the  direct  numerical  solution  (solid)  and  the  boundary  layer  solution 
(dashed)  for  Pe  =  1000. 


where  q  and  (3  were  adjusted  numerically  to  obtain  a  solution  which  satisfied  the  boundary 
conditions  at  £  =  0.  Before  calculating  higher  order  terms,  we  compared  the  boundary  layer 
solution  against  the  solution  from  a  direct  numerical  simulation  of  the  advection-diffusion  equa¬ 
tion.  Figure  2  shows  a  plot  of  the  two  solutions  for  Pe  =  1000.  The  agreement  suggests  that 
leading  terms  do  indeed  capture  the  asymptotic  behaviour.  The  inner  solution  is  thus  well 
described  by 


r-2/3  r-2/3 

fin  =  - F( 0,  9in  -  - G{ 0. 


(5.13) 


The  final  approximate  solution  to  the  coupled  ODEs  to  leading  order  is  the  composite  of  the 
inner  and  outer  solutions  (recovering  all  the  scalings  and  letting  6  —  c/rl^ks) 
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9{y)  =  WskuV  9in  9out  =  Uks~7~G  \ 

S  l:G(l 


kuV 


sin  (kuy) 


foul/ 


(5.14a) 


Uks  kuS  \SJ  sin (kuy)  ’ 

Armed  with  this  we  can  compute  the  multiscale  mixing  measures  (|VP0|2)  for  p  =  0, 1,  —1. 


(5.14b) 


5.1.1  Variance 


The  variance  is  (N.B.  only  computing  over  1/4  period) 

(O2)  =  1  «/2>  +  <92})  =  \\  (jT s\v)iv  +  jT 92(s/)<%) 
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(5.15) 


(5.16) 
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Figure  3:  Mixing  efficiencies  for  p  =  1  denoted  by  x,  p  =  0  denoted  by  *,  and  p  =  —  1  denoted 
by  +  for  the  steady  shearing  flow  with  r  =  1  from  direct  numerical  simulations  with  U  fixed. 
The  solid  lines  are  the  asymptotic  scalings. 


Here  we  are  interested  in  the  scaling  as  6  — >  0.  In  that  case  we  are  justified  in  replacing  the 
upper  limit  of  the  integral  of  F2(rj)  by  infinity  since  the  outer  solution  is  zero.  More  care  must 
be  taken  with  the  integral  involving  G2{rj).  We  note  that  k2rj2S 2  j  sin 2(kur]S)  is  bounded  by  0 
and  7r/2  and  hence  we  can  apply  Lebesgue’s  dominated  convergence  theorem  to  obtain 

<#2> K  •  <5-17> 


Recall  from  section  2  that  (6q)  =  52//c2/cf  and  hence  the  mixing  efficiency  is 


M0  -  Cr1/sVeb/6, 


C  = 


Fk_ _ 1 _ 

]/Jo°  F2(v)dv  +  JT  G2{rj)dr] 


(5.18) 


Figure  3  shows  Mo  as  a  function  of  Pe  from  direct  numerical  simulations.  The  scaling  fits 
Pe5/6.  There  is  only  a  5%  difference  between  the  prefactor  calculated  from  the  boundary  layer 
solution  and  that  calculated  from  the  direct  numerical  simulation  for  Pe  =  1000  (after  rescaling 
Pe  to  Pe).  Remarkably,  the  Pe5/6  scaling  is  also  observed  for  the  HIT  stirring  (figure  1).  The 
scaling  in  r  was  also  confirmed.  The  Pe5/6  scaling  would  hold  for  different  values  of  r  with  a 
corresponding  change  in  the  value  of  the  prefactor. 


5.1.2  Gradient  variance 


The  gradient  is 

<|V9|2)  =  y  [</2>  +  (92)]  +  \  [«/')2>  +  «9')2)]  (5.19) 

noting  that  the  first  term  was  computed  in  the  previous  section  we  focus  attention  on  the  second 
term.  Computing  the  gradient  we  obtain 
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(5.20) 
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The  leading  order  contribution  to  the  integral  is  the  square  of  the  first  term  (1  /<52  versus  1/5). 
Hence 


\  [<(/')2>  +  (to')2)] 


1  S2  1 
7T  iPk2  kJP 


{F')2{r])dT]  + 


(G')2(v) 


kffiS2 

sin2  (kurjS) 


dr i 


(5.21) 


and  by  the  same  arguments  as  the  previous  section  (DCT  etc.)  we  obtain 
<|W|2)  K + 

Recall  from  section  2  that  (|V0o|2)  —  52/k2/c2  and  hence  the  mixing  efficiency  is 

Mj  w  CVe1'2,  C  =  Jl—=====J====, 

V  J /0°° {F')Hv)dTi  +  /0°° WfijMn 


(5.22) 


(5.23) 


The  boundary  layer  and  direct  numerical  solution  prefactors  differ  by  approximately  1%  for 
Pe  =  1000.  Figure  3  shows  the  scaling  of  M\  from  the  direct  numerical  solution  that  confirms 
the  Pe1/2  scaling.  The  scaling  in  r  was  also  confirmed.  Interestingly,  stirring  at  small  scales 
docs  not  enhance  the  mixing  efficiency  on  small  scales.  This  is  because  the  decrease  in  gradient 
variance  due  to  stirring  on  small  scales  is  compensated  by  the  increase  in  gradient  variance  in 
the  boundary  layer. 


5.1.3  Inverse  gradient  variance 

The  inverse  gradient  variance  is 

(|V_16>|2)  =  (|V-1(/(y)  sin(fcsx)  +  g(y)  cos(kax))\2)  (5.24) 


This  is  the  trickiest  of  the  three  multiscale  mixing  measures.  We  can  simplify  the  integral  by 
noting  that  the  leading  Fourier  component  of  g(y)  is  zero.  Expanding  f(y)  in  a  Fourier  series 


f(y)  =  fn  cos(nkuy) 


71=0 


we  obtain 


(5.25) 


V  1  (f(y)  sin(A:s.x))  =  ^ 2~ 2  fn  sin (nkuy)  sin{ksx)i  + 


nk2  +  n2k2  ’ 

71=0  *  V 

nk-u 


k2  +  n2kl 


fn  sin {nkuy)  cos(fcsx)j 


(5.26) 


and 


(|v  1(/(y)sin(fcsx))|2)  = 


I  fn 


1  ^  \fn 


n=0  ks  +  n2kl  ki 


~  l./o|2  +  T2 


hu  ^  n2  +  r<1 

u  n= 1 


1  7T 


-  +  ^2  XI  ~T  -  ^2  l/cl2  +  ^2  g  supl/nl^.  (5-27) 


k2  ^  n2  -fcf 
“  n=l  4 
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Pe1/6 


Figure  4:  M_i/Pe5/6  versus  Pe1//6  from  direct  numerical  simulations  (*).  The  solid  line  is  the 
asymptotic  scaling. 

Computing  the  first  Fourier  coefficient 

1  p/4  i  Q  r°° 

l/o|  =  -  /  Mdy  =  -JJT  /  F(ij)dr,  (5.28) 

ft  Jo  ft  U  ks  J o 

and  recalling  from  section  2  that  (|  V_1#o|2)  =  S'2/ we  obtain  the  large  scale  mixing  efficiency 
scaling 

M_i  -  CVe  (5.29) 

where  C  is  a  prefactor  depending  on  the  integral  of  F  (square  of  the  mean  versus  the  mean  of 
the  square).  Figure  3  of  the  direct  numerical  solution  confirms  the  Pe  scaling.  The  boundary 
layer  and  direct  numerical  solution  prefactors  differ  by  15%. 

We  note  two  interesting  things;  first  to  leading  order  there  is  no  dependence  on  r  even  though 
we  might  have  expected  that  increased  shear  would  increase  mixing  on  large  scales  (see  section 
3).  Second  we  expect  that  the  next  order  term  would  be  proportional  to  Mq  i.e.  that 

M_ i  ~  Pe  +  r1/3Pe5/e.  (5.30) 

Figure  4  shows  a  plot  of  M_i/Pe5/6  versus  Pe1/6  from  direct  numerical  simulations.  A  very 
non-rigorous  check  of  the  scaling  involves  comparing  the  slope  of  the  line  in  figure  4  to  the 
prefactor  above.  The  slopes  differ  by  20%.  Note  in  that  plot  the  current  limit  of  large  Peclet 
and  fixed  r  requires  r  <  0.1  y/Pe  i.e.  r  <  3(5<1). 

There  are  still  some  things  we  do  not  understand.  Are  there  universal  scalings  in  Pel  Namely 
for  the  steady  flow  and  the  Zeldovich  sine  flow  we  get  the  same  scalings.  Do  these  scalings  appear 
for  other  flows  with  this  source? 


5.2  Regular  perturbation  expansion 

We  now  seek  the  behaviour  of  the  multiscale  mixing  efficiencies  for  r  >>  1,  Pe  fixed.  Going  back 
to  our  system  of  ODEs  and  setting  y  —  kuy  we  obtain 

dT 

dy2  J 


5  sin(y)/  = 


i 

—  Z2 


9 


-^sin(y)5  = 


i 


d 2 


r2  dy2 


/  + 


y/2  S  1 

rcfc2  r2 


(5.31a) 

(5.31b) 
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Now  we  perform  a  regular  perturbation  expansion  in  powers  of  r  2: 


f  =Y2fnr  2n,  g  =  J2gnr 


—2  n 


n= 0 


71=0 


At,  0(1)  we  obtain 

fo  =  0  ->  /o  =  const,  g'o  =  0  -►  g0  =  const 
continuing  as  usual  we  obtain  at  0(r-2) 


Ve sin(y)/o  =  -go  +  9u  ~  Vesm(y)g0  =  -f0  +  fx  + 


„  .  Vis 


(5.32) 


(5.33) 


(5.34) 


The  average  over  a  period  of  the  first  and  second  equations  implies  go  =  0  and  fo  =  v2 S/nkg. 
Thus,  /]  =  const,  and  g\  =  —(ViS/Kk%)Ve  sin (y)  +  const.  Continuing,  we  obtain  at  0(r  4): 


Ve sin(y)/j  =  -g}  +  g'V  -  Ve  sin(y)yi  =  -/i  +  f'f. 
The  average  over  a  period  of  the  first  and  second  equations  imply 

,  Vi  s  2  Vis„  .  ,_s 


27T  K,lzl 

Hence  to  leading  order  the  solution  is 


Vis  \(,  Ve2\  .  n  N  Pe  .  ; 

6  Vkf  [  V  ~  27rr^  y  sm(fcsx)  -  sm(y)  cos (ksx) 


(5.35) 


(5.36) 


(5.37) 


Given  the  asymptotic  solution  for  r  2>  1  at  fixed  Ve  we  compute  the  variance  as  (keeping  only 
the  leading  order  term  and  the  0(r~ 2)  term) 


<02 


S2 

K>kj 


1  - 


1  Ve 2 


7 r  r * 


which  implies 


Mq  ~  1  -j — 


1  Ve2 


7r  r 


2  • 


(5.38) 


(5.39) 


It  is  clear  from  this  result  that  stirring  on  ever  small  scales  (increased  shear)  ceases  to  suppress 
the  variance.  This  is  because  in  the  limit  r  1  the  flow  is  diffusion  dominated  hence  the  mixing 
efficiency  goes  to  one. 

The  gradient  to  leading  order  is 


V0 


Vis 


Kkl 


1  - 


1  TV 


7 r  rz 

Ve 
r2 


and  so  the  gradient  variance  is 


Ve 

kscos(ksx )  H — tv  sm(y)ksm(ksx) 


cos (y)  cos(ksx) 


S  f  IVe2 

j-  o 


1  + 


K.A-2 


7t  r‘ 


(5.40) 


(5.41) 
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which  implies  the  mixing  efficiency  is  identical  on  small  and  intermediate  scales  M\  —  Mo* 

The  inverse  gradient  to  leading  order  is 

__i  n  y/2 S  f  IVe2  k  m  ,  s  Ve  ,  ks  ,  * 

V  Ott——  1 - 0-70  sin  (ksx)  +  -o-  sin(y)yz - - 7  cos(fcsx) 

K,k~  i  7T  rl  k~  r 1  kj  +  pz 

Ve  p 

— 5- cos (y)jY~. — 2  cos(ksx)  j  (5.42) 

rz  kg  +  pz 

and  hence  the  inverse  gradient  variance  is 

*  d?  (*  -  ;£)  (5-43) 

and  hence  M_  1  =  Mo  (as  suspected).  Thus  in  the  limit  r  1  and  Ve  fixed  all  of  the  efficiencies 
are  the  same. 

Summarizing  the  different  limits,  we  find  that  the  norms  scale  differently  in  Pe  for  fixed  r 
and  the  scalings  were  confirmed  with  the  direct  numerical  solution.  However,  the  norms  behave 
similarly  for  fixed  Pe  and  r  1  where  the  flow  is  diffusion  dominated. 


6  Bounds  on  the  multiscale  mixing  efficiencies  with  scalar  decay 

Now  consider  the  advection-diffusion  equation  for  the  concentration  of  a  passive  scalar  0(x,t) 
which  has  a  slow  decay  rate  a  maintained  by  a  body  source  s(x)  with  spatial  mean  zero: 

—  +  u  *  V#  =  kA 6  +  s(x)  -  a6  (6.1) 

where  k  is  the  molecular  diffusivity.  The  decay  rate  may  have  various  interpretations  such 
as  the  decay  rate  due  to  chemical  kinetics,  or  radiative  relaxation  in  meteorology  (relevant 
for  the  sphere).  We  follow  the  procedure  of  TDG  which  was  used  to  derive  upper  bounds  on 
mixing  efficiency  for  a  =  0  to  derive  upper  bounds  on  (|VP#|2)  for  p  =  0,1,  —1  when  a  /  0. 
We  re-define  the  advection-diffusion  operator  and  its  formal  adjoint  to  include  the  slow  decay: 
Ca  :=  dt  +  u  •  V  —  kA  —  a  and  £«  :=  —  dt  —  u  *  V  +  kA  —  a .  Now  we  proceed  with  computing 
bounds  on  the  multiscale  mixing  efficiencies. 

6.1  Bounds  on  the  variance 

Following  the  TDG  procedure,  we  perform  the  following  optimization: 

{62)  >  maxmin{{02)  |  (0(u  •  V</?  +  nAp  —  cap))  =  —{(/9s)}  (6.2) 

$ 

upon  applying  the  Cauchy- Schwarz  inequality  and  maximizing  over  tp  we  obtain 

(6>2)  >  (s.Mffs)  (6.3) 

where  _Mq  :=  (£a£a)-1.  Substituting  the  definition  of  £a  and  restricting  to  HIT 

(sMqs)  =  (s{k2 A2  -  uu  :  VV  +  k(2Vu  •  VV  +  Vu  •  V)  -  cm  •  V  -  2a«A  +  a2}_1s) 

=  (s{k2A2  —  (U2/d)  A  —  2naA  +  a2}_1s).  (6.4) 
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The  variance  in  the  absence  of  stirring  is 


(#o)  =  (s{k2A2  —  2/cttA  +  a2} 


(6.5) 


and  hence  an  upper  bound  on  the  mixing  efficiency  is 

(s{k2A2  —  2koA  +  a2}~ls) 

”  (s{k2A2  —  (U2/d) A  —  2koA  +  g:2}-1s) 

In  Fourier  space  this  is 

(Ma) 2  <  (y'  l^(k)l2  ^  f y' _ lg(k)l2 _ 

{  oJ  -  yZ-,  K2k4  +  2 KQk2  +  a2  J  K2k4  +  ( U2/d  +  K.a)k2  +  a2 

Clearly,  the  upper  bound  depends  on  the  Fourier  transform  of  the  source  function  5.  In  the  next 
section  we  investigate  the  high-Pe  behaviour  of  Mq  for  a  measure  valued  source.  To  determine 
the  high-Pe  behaviour  we  approximate  the  sums  by  integrals  and  take  the  limit  of  infinite  volume 
noting  that  the  a  term  allows  the  integrals  to  converge: 


( m°\2  <  ( H  i*(k)i2fcrf~ldrifc  t  ( r  i mfk^k  \ 
0  ~  \Jo  k4  +  2nak2  +  o2 )  yJo  +  2na)k2  +  a2) 

Letting  £=  in  d-D  we  obtain: 


(Mo) 


< 


(  r °°  l^yf)! 2J^M\ 

\Jo  C  +  2£2  +  1  ) 


r  \ 

Jo  £4  +  (Pe2  +  2)^2  +  I ) 


(6.8) 


(6.9) 


where  Ve  \=  U£/k  =  U / y/Va  where  £  =  yfnj  a  is  the  diffusive  length  scale  i.e.  the  distance 
travelled  by  diffusion  before  decay. 


6.1.1  Delta  function  source 


Consider  a  delta  function  point  source  so  that  there  is  a  separation  of  scales  between  the  source 
and  the  stirring  field.  In  d.  =  2  we  obtain 


r°°  1  _ _  1 

Jo  e+2^2  +  l  2 '  Jo  £4  +  ( VeVe 2  +  2)£2  +  1  2  6  “  £2 

where  £]  and  £2  are  the  solutions  of  the  quartic  equation: 


(6.10) 


6,2  - 


*Ve 


1  +  AVe 


2 


( Ve  +  2)  t  Ve  (l  +  2Ve  2  -  TPe  4  +  ...) 

2 


In  the  large  Ve  limit  the  mixing  efficiency  is 


Mo 


Ve 


2  In  Ve 


(6.11) 
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Note  this  is  a  distinct  scaling  as  regards  k  compared  to  the  problem  with  a  =  0.  An  efficiency 
scaling  of  ~  Ve  implies  the  eddy  diffusivity  would  scale  as  y/R.  For  d  =  3  we  have 


r  e#  i  r  (eu) 

Jo  £4  +  2£2  +  1  4  ’  J0  £4  +  (fe2  +  2)^2  _|_  1  2  £i  —  £2 

In  this  limit  the  mixing  efficiency  is 

MS<\Vfe.  (6.13) 

Thus  for  d  =  3  we  also  have  an  anomalous  scaling  in  Ve  with  an  equivalent  diffusivity  propor¬ 
tional  to  ft1/4. 


6.2  Bounds  on  the  gradient  variance 

For  the  gradient  variance  the  optimization  problem  is 

<|W|2)  >  maxmin{{|V0|2)  |  (V0  *  (u ip  +  nV^p  —  a(V(A“V)))|2)  =  (6.14) 

v  e 

upon  applying  the  Cauchy-Schwarz  inequality  (we  will  revisit  this  approach  at  the  end  of  this 
section)  we  obtain 


<|W|2)  >  max  < 


_ M! _ 

(\kV<p  +  <pu  —  o(V(A~V))l2)  * 


Under  the  assumptions  of  HIT 


(6.15) 


(|kV<£>  +  u<p-  a(V(A~V))|2)  =  («2|V<^|2  +  |u|  V  +  a2|V(A“V)|2  ~  2«aV(A“V)  *  V^6.16) 


The  solution  to  the  optimization  problem  is  (after  some  algebra) 


<|V0|2)  >  (sM'fs) 


(6.17) 


where  A4“  :=  (— k2A  +  ^  +  2/ca  +  a2  A  *)  1.  Given  the  gradient  variance  in  the  absence  of 
stirring  we  obtain  a  bound  on  the  small  scale  mixing  effciency 


(Mi)2  < 


(s{—k2 A  +  2na-\-a2A  :}  ls) 
(s{— k2  A  +  +  2na  +  a^A-1}-1#) 


In  Fourier  space  the  bound  is  expressed  as 


(Mf)2  < 


f  ls(k)[2  \ 


fy  l^)l2  \ 


(6.18) 


(6.19) 


Once  again  since  we  are  interested  in  the  high-Pe  behaviour  we  approximate  the  sums  by 
integrals 


/  f°°  k2\s(\d)\2kd~'1  ddk  \  (  r°°  fc2|i(k)[2fcd~1ddfc  \  1 

\io  K2fc4  +  2Kak2  +  a2)  yj0  k2/c4  +  (^  +  2na)k2  +  a2 ) 


(6.20) 
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Letting  £  =  in  dr D  we  obtain: 


-l 


(Mcs 2<  /  rmv/f)i2£d  (  r ^iVW^di] 

{  l)~[Jo  ^  +  2^2  +  1  JU  £4  +  (£  +  2K2  +  lJ 


(6.21) 


Clearly  the  convergence  of  these  integrals  depends  on  the  property  of  the  source.  We  must  have 
for  a  Y  0,  s  €  . 

Here  we  re-examine  the  application  of  the  Cauchy- Schwarz  inequality.  The  analysis  can  be 
improved  by  evaluating 

min{(|V0|2)  |  (V0  ■  (up  +  -  a(V(A_Ip))))  =  <ps)}  (6.22) 

e 


with  functional  JF  :=  (5|V0|2  +  A(v- V0  —  <ps))  where  v  =  uip+KVip  —  Of(V(A“1</?)).  As  in  section 
3  the  solution  to  the  optimization  problem  depends  on  the  two-point  correlation  statistics  of  the 
velocity  field 


<|V0|2) 


(V*)2  >  (V*)2 

((V.v)(-A-i)V.v)  -  <|v|2)* 


(6.23) 


In  the  case  of  HIT  a  strict  application  of  the  Cauchv-Schwarz  inequality  yields  the  optimal 
bound.  Again  this  analysis  does  not  apply  to  either  the  variance  or  inverse  gradient  variance. 


6.2.1  Delta  function  source 

As  for  the  case  of  o  =  0,  a  (5-function  or  white  noise  source  will  cause  the  integrals  in  (6.21)  to 
diverge  and  thus  Mf  =  1  for  (5-function  sources. 


6.3  Bounds  on  the  inverse  gradient  variance 

For  the  inverse  gradient  variance  the  variational  problem  is 

(|V“]0|2)  >  maxmin{(|V“10|2)  |  (VA"1^  ■  V(u  ■  V</?  +  K,A<p  —  o^)|2)  =  (ips)}  (6.24) 
v  e 


upon  applying  the  Cauchy-Schwarz  inequality 


(|V-J0|J)  >  max 


Under  the  HIT  assumptions 


v  (| V(u  •  Vy?  +  kA<p  —  oup) |2) 


r2  u 2 

|2  i  1V7  „|2  ,  /  a,«\2  i  r7  \2 


(6.25) 


(|V(u  •  Vtp  +  xA(p  —  aip)\  )  =  (k|AV</?F  +  —  |V^p  +  —(Aip)2  +  or\ Vy?|  —  2 anVip  •  A Vy?). 
The  solution  to  the  optimization  problem  is  (after  some  algebra) 

<|V'^|2)  >  {sMljs) 


(6.26) 


where  A4°]  :=  (k2A3  —  (T2/d)A  +  (U2/d)A‘2  +  2koA2  —  o2A)  1 .  Given  the  inverse  gradient 
variance  in  the  absence  of  stirring  we  obtain  a  bound  on  the  large  scale  mixing  effciency 


(A/?)2  < 


(s{k? A3  +  2w.qA2  -  q2A}~1s) 
(s{k2A3  —  ^A  +  ^-A2  +  2kq'A2  —  a2A}-1s) 


(6.27) 
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In  Fourier  space  the  bound  is  expressed  as 

(Ma  )2  <  ('V'  lg(fc)l2  )  ( y' _ _ 

1  ~  \J  +  2 Kak4  +  a2k2J  ^  K2fc6  +  (Ul  +  2 «a)&4  +  (a2  +  r2)fc2 


-1 

.  (6.28) 


Once  again  since  we  are  interested  in  the  high  Ve  behaviour  we  approximate  the  sums  by 
integrals 


(i 


(M~l)  k2(K2k4  +  2nak2  +  a2) 

Letting  £  =  in  d— D  we  obtain: 

!*(£>/!) 


Hi 


/c2|s(fc)|2fcrf-1<id/c 


o  k2{n2k4  +  {-j-  +  2na)k2  +  a2  +  T2) 


(m: 


7 

Jo 


-1 


£2(£4  +  2^2  +  1)  )  o  ^  +  ( Ve  +  2)?  +  1  +  5) 


(.6.29) 


(6.30) 


In  d= 2  there  may  be  an  infra-red  divergence  problem.  The  integrals  converge  if  |s(k)|2  =  f (k) 
if  f(k )  ~  kP  where  (3  >  0  (we  require  mean  zero  sources  as  to  prevent  blow  up  at  0).  This  is  our 
only  restriction  on  the  source.  How  the  Fourier  transform  decays  as  k  — »  0  indicates  the  large 
scale  structure  of  the  source.  Exploration  of  the  bound’s  behavior  remains  a  task  for  the  future. 


6.3.1  Delta  function  sources 


In  d=  3 


f°°  d£  _  7 r  fc 

Jo  £4  +  2£2  +  1  4  ’  J0 


d( 


v  n/6  -  V& 


£4  +  2£2  +  l  4’  Jo  ^  +  (Ve+  2)£2  +  l  +  g  2^-6)766 
where  £1  and  £2  are  roots  of  the  quadratic  equation: 


6,2  =  (Ve  +2 )  ±  ^ 


4  -  2  4T2 

Ve  +AVe - 


a •* 


In  the  limit  of  Ve  >  1  we  get  to  leading  order  in  Ve: 

2 


.  2  -  2  L  r2/c2  1  Ve 

i,\—Ve  -  Ve  \  1  -  4 


9  /  F2/72 

w~m'  &K2Pe'  fc-^v1-4-^- 


hence 


r 

Jo 


dt 


7 r 


y/vPt 


0  ^  +  C  +  2)e2  +  l  +  5  2 


2Ve 


Ve 3 
Pe? 


(6.31) 


(6.32) 


(6.33) 


(6.34) 


where  Pe\  =  UX/k  where  A  (a  Peclet  number  using  a  length  scale  of  the  velocity  field).  Hence 

(M^)2  <  ^  (6-35) 

note  that  the  efficiency  may  be  larger  when  there  is  stronger  shear  T  >  1  as  was  found  in  the 
case  of  a  =  0.  Table  2  summarizes  the  high-Pe  scaling  for  HIT  in  the  case  of  a  =  0  and  a  /  0 
for  a  ^-function  source. 
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a  =  0 

II 

"*3 

d= 3 

a^0 

d= 2 

d= 3 

Mi 

l 

1 

M“ 

1 

1 

Mo 

Pe 

\/log  Pe 

•J¥i 

M0Q 

Pe 

y/\ogPe 

VTe 

M_i 

PeL 

\^/\og(\+L2/X2) 

Mf , 

* 

Pe 

Table  2.  High-Pe  scalings  of  the  multiscale  mixing  efficiencies  for  a  6-function  source.  The 
*  indicates  that  the  scaling  depends  on  |s(k)|  as  k  — »  0  and  Ve  U£/k  =  UjyfKoi.  where 
i  =  y/Kfa. 

7  Conclusions  and  future  work 

Multiscale  mixing  efficiencies  are  susceptible  to  rigorous  analysis.  Upper  (lower)  bounds  on  mul¬ 
tiscale  mixing  efficiencies  were  obtained  from  lower  (upper)  bounds  on  appropriately  weighted 
variances.  Bounds  on  large-scale  mixing  are  sensitive  to  small-scale  stirring.  The  bounds  can  be 
sharp  (sweeping  flows  on  the  torus).  Furthermore,  the  efficiency  of  some  complex  random  flows 
can  be  understood  via  simple  steady  state  scalings.  Finally,  the  inclusion  of  a  decay  term  in  the 
advection-diffusion  equation  introduces  new  features  namely  new  high-Pe  dependences  of  the 
equivalent  diffusivity  on  the  molecular  diffusivity. 

The  current  analysis  has  only  answered  some  of  the  questions  posed  in  the  introduction 
hence,  there  are  exciting  problems  that  are  the  subject  of  current  and  future  investigation.  For^ 
example,  extending  the  current  analysis  to  bounded  domains  (sphere  etc.)  and  formulating  an 
appropriately  constrained  variational  problem  for  the  optimal  (source  specific)  stirring  field. 
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1  Introduction 

The  behaviour  of  passive  scalar  tracers  moving  in  a  prescribed  velocity  field  can  be  described 
using  two  equivalent  formulations:  (i)  the  passive  scalar  equation  which  in  two  dimensions 
can  be  written  as 

dt6  +  VXH  ■  V0  =  eAO,  (1) 

(ii)  probabilistic:  description  in  terms  of  stochastic  differential  equations 

d.X,  =  -dyH(Xt  ,Yt)  +  V2e  d.w{x) ;  (2a) 

d.Yt  =  dxH{XuYt)  +  V 2e  d\vjy) .  (2b) 

Here  H y)  is  a  periodic  stream  function  defined  in  such  a  way  that  the  velocity  field  is 
given  by  u  =  (—dyH,  dxH).  In  what  follows  the  velocity  field  will  always  be  deterministic 
and  time-independent. 

The  subject  of  study  in  this  report  is  the  asymptotic  behaviour  of  the  process  (2)  in 
the  limit  e  —>  0.  In  fact,  this  question  can  be  analyzed  using  the  passive  scalar  equation 
(1).  Here  the  homogenization  technique  from  the  theory  of  partial  differential  equations  is 
applied  which  gives  a  description  of  the  behaviour  of  solutions  of  (1)  on  large  scales,  see 
[1].  Separating  slow  and  fast  variables  and  performing  a  multiscalc  analysis  one  obtains  the 
effective  diffusion  equation  for  the  evolution  of  9  on  large  scales 


dte  =  v  •  Dye,  (3) 

The  effective  diffusivitv  D*  is  a  constant  matrix  given  by  the  following  expression 

£>;(e)  =  <(eI  +  *)(Vx  +  e)-e),  (4) 

where  \  is  the  solution  of  the  so  called  cell  problem 

V-[(eI  +  *)(Vx  +  e)]  =  0.  (5) 

Note  that  equation  (5)  has  to  be  solved  in  the  domain  of  periodicty  of  the  streamfunction 
H (.7;,  y).  One  class  of  flows  for  which  the  cell  problem  (5)  is  amenable  to  analysis  is  a  special 
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case  of  the  well  known  ABC  flow,  namely  the  case  A  =  1,  C  =  0.  The  streamfunction  is  of 
the  form 

u  f  \  1  B  1  —  B  .  (  ^ 

H (rr,  y)  —  — - —  cos  x  cos  y  +  - — - —  sin  x  sm  y.  (o) 


The  case  B  =  1  has  been  treated  in  [2]  using  boundary  layer  techniques  where  it  was 
shown  that  D*  =  0(y/e ).  More  precisely,  the  boundary  layer  analysis  is  performed  in  a 
boundary  layer  of  width  0(y/e)  which  forms  at  the  boundaries  of  the  impermeable  cells 
of  the  underlying  velocity  field  \/±H(x^y).  Heuristically,  since  the  amount  of  the  tracer 
transported  along  the  boundary  layer  is  proportional  to  its  area,  the  effective  diffusivity 
has  to  be  proportional  to  y/e.  An  analytical  solution  of  the  cell  problem  in  the  boundary 
layer  approximation  was  given  in  [3]  using  Wiener-Hopf  technique.  In  the  case  B  <  1  the 
effective  diffusivity  is  anisotropic  and  as  has  been  shown  in  [4]  the  effective  diffusivity  across 
the  streamlines  is  of  order  0(e)  while  along  the  stream  lines  of  the  flow  it  is  of  order  0(  1/e). 

The  large-scale  picture  obtained  by  the  homogenization  approach  as  presented  above 
does  not  give  any  information  on  the  microscale  structure  of  the  diffusion  processes  hap¬ 
pening  in  the  flow.  However,  small  scales  do  play  a  major  role  in  the  form  of  boundary 
layers  which  is  typical  for  diffusion  processes  at  high  Peclet  numbers. 

In  principle,  stochastic  differential  equations  (2)  allow  us  to  describe  the  diffusion  of 
particles  in  much  more  detail.  But  how  much  information  can  we  obtain  on  the  limit  e  — >  0? 
A  standard  approximating  technique  in  this  framework  is  the  so  called  Wentzell-Freidlin 
method.  It  describes  the  behaviour  of  randomly  perturbed  Hamiltonian  systems  on  large 
time  scales  at  high  Peclet  numbers  by  means  of  continuous  diffusion  processes  on  graphs,  as 
is  explained  in  Section  2.  A  priori,  this  technique  works  only  only  for  Hamiltonians  H(x)y) 
such  that  H(x,y)  +oo  when  |(x,y)|  — >  oo  and  does  not  allow  for  existence  of  heteroclinic 
orbits. 

If  we  try  to  apply  this  technique  to  the  case  of  unbounded  cellular  flow  with  H (x,  y)  given 
by  (6)  we  arrive  at  a  paradox.  Namely,  the  transition  from  one  cell  to  another  will  happen 
instantaneously  no  matter  how  large  the  spatial  separation  between  the  cells.  Furthermore, 
the  characteristic  time  scale  of  diffusion  will  be  of  order  0(1 /e)  which  contradicts  the  results 
obtained  in  the  homogenization  framework  where  the  characteristic  time  scale  is  of  order 

0(1/Ve). 

However,  for  bounded  domains  this  technique  works  for  sufficiently  small  e  such  that 
1  / y/e  is  much  larger  that  the  domain  size  (therefore  it  is  not  in  contradiction  to  the  homog¬ 
enization  method).  This  gives  us  an  indication  that  the  Wentzell-Freidlin  method  remains 
locally  valid.  In  fact,  it  fails  on  unbounded  domains  because  of  its  global  structure  which 
is  determined  by  the  method  of  “gluing”  together  single  cells. 

Actually,  by  changing  the  “gluing”  prescription  between  the  processes  obtained  in  single 
cells  we  can  make  the  Wentzell-Freidlin  approach  to  be  consistent  with  the  results  given 
by  the  homogenization.  The  main  ingredient  here  is  the  conservation  of  probability:  the 
probability  current  going  out  of  a  cell  has  to  be  matched  with  the  probability  current  across 
the  cell  boundary  (dominated  by  the  boundary  layer  effects)  into  the  neighbouring  cells. 

The  structure  of  the  present  report  is  as  follows:  Section  2  is  entirely  devoted  to  the 
Wentzell-Freidlin  technique.  In  Section  2.1  we  explain  the  averaging  principle  for  a  single 
cell  and  its  deterministic  background.  Asymptotics  e  — »  0  of  solutions  to  (2)  on  bounded 
domains  is  described  in  Section  2.2.  In  Section  2.3  we  discuss  some  simple  models  for 
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diffusion  in  unbounded  cellular  flows.  Furthermore,  using  results  of  numerical  simulations 
we  argue  that  the  approximating  process  on  unbounded  domain  will  be  discontinuous.  In 
Section  3  we  outline  the  procedure  for  obtaining  this  process  and  state  the  result.  Possible 
applications  are  discussed  in  Section  4. 

2  Random  perturbation  of  Hamiltonian  systems 

For  small  values  of  e  (i.e.  for  high  Peclet  numbers)  equations  (2)  describe  small  random 
perturbation  of  the  determinstic  system 

x  =  -dyH{x(t),y{t)),  (7a) 

y  =  dxH{x{t),y(t))  (7b) 

which  represents  the  limiting  case  e  =  0.  Heuristically  one  would  expect  that  the  behaviour 
of  solutions  to  (2)  in  the  limit  e  ^  0  is  to  a  large  extent  determined  by  the  deterministic 
solutions,  i.e.  solutions  of  (7).  The  simplest  ansatz  of  this  type  consists  in  making  a  per¬ 
turbative  expansion  of  solutions  to  (2)  in  powers  of  around  the  deterministic  solutions1, 
see  [11,  12].  However,  this  approximation  is  of  little  use  when  we  want  to  study  long-time 
behaviour.  Indeed,  it  works  well  only  for  a  finite  period  of  time  (which  also  remains  true 
when  we  include  higher-order  terms)  and  does  not  take  into  account  the  separation  into 
fast  and  slow  variables.  The  latter  point  is  of  special  interest  to  us  because  the  underlying 
structure  of  the  deterministic  case  (to  be  described  in  Section  2.1.1)  is  at  the  basis  of  the 
approximating  method  discussed  in  Section  2.1. 

2.1  Slow-scale  motion  inside  a  cell 

2.1.1  Case  of  vanishing  viscosity 

In  terms  of  (1)  the  deterministic  case  corresponds  to  the  passive  scalar  equation  with  van¬ 
ishing  viscosity  e  =  0 

dte  +  X7xH  ■  V0  =  0  (8) 

and  an  initial  condition  6q.  The  charateristics  of  this  equation  are  given  by  (7)  so  that 
the  solutions  of  the  Cauchy  problem  for  equation  (8)  can  be  constructed  in  terms  of  the 
one-parametric:  flow  (by  taking  the  inverse  Lagrangian  mapping)  generated  by  the  velocity 
field  VxH(x,y),  see  e.g.  [5]. 

If  we  choose  the  streamfunction  (6),  then  equations  (7)  can  be  solved  explicitly,  see  [6]. 
For  the  particular  case  B  =  1  the  solution  is  given  in  terms  of  Jacobi  elliptic  functions  (see 

[7]) 

x(t,  h)  =  arcsin  ^\/l  — /i2sn(/,  Vl  -  h*))  ,  (9a) 

y(t,  h)  =  arcsin  (\/l  —  h2  cd (t,  y/l  —  /r2))  ,  (9b) 

Mn  the  ca.se  of  the  Hamiltonian  (G)  with  B  ~  1  the  explicit  solution  of  (7)  given  by  (9)  allows  us  to 

compute  analytically  the  first  order  term  in  the  expansion.  It  turns  out  to  be  a  stochastic  integral  the 

integrand  being  a  complicated  expression  involving  Jacobi  elliptic,  hyperbolic  and  logarithmic  functions. 


146 


where  x(0)  =  0  and  H(x ,  y)  =  h  is  kept  fixed  along  a  trajectory.  Obviously,  a  particle 
initially  located  in  one  particular  cell  will  stay  inside  this  cell.  The  period  of  motion  along 
a  level  line  with  H(x,y)  =  h  is  given  by  a  complete  elliptic  integral  4K(y/l  —  h 2).  For  the 
special  case  h  =  0  the  equation  of  motion  along  the  separatrix  can  be  solved  in  terms  of 
elementary  functions 

x(i,  0)  =  2  arctan  e*  —  (10) 

where  the  initial  conditions  are  specified  as  x(0, 0)  =  0  and  y(0,0)  =  7r/2.  Note  that  in  the 
deterministic  case  it  takes  an  infinite  time  to  reach  the  equilibrium  point  (7r/2,7r/2). 

It  is  crucial  for  the  following  analysis  that  we  can  interpret  (7)  as  a  Hamiltonian  system 
with  one  degree  of  freedom  by  identifying  (x,y)  — »  (p, q).  The  streamfunction  H{x,y)  — > 
H(p,q)  is  then  identified  with  the  Hamiltonian  of  the  system.  As  usually,  an  action  variable 
1(h)  can  be  introduced  as 


1(h)  =  ~J  K  (A  -  fr/2)  dti,  (11) 

which  is  equal  to  the  area  inclosed  inside  the  orbit  if(p,  q)  =  h  divided  by  27r,  see  [8]. 

2.1.2  Effective  Fokker-Planck  equation 

We  will  first  study  the  behaviour  of  solutions  to  (2)  with  H  given  by  (6)  with  B  =  1 
such  that  the  particle  is  staying  inside  one  cell.  The  typical  technique  for  analyzing  the 
evolution  of  slow  variables  of  such  a  system  subject  to  small  random  perturbations  is  the 
technique  of  averaging  out  the  fast  variables.  Then  an  effective  evolution  equation  for  slow 
variables  inside  a  cell  is  obtained  in  a  way  similar  to  the  analysis  of  small  perturbations 
in  classical  mechanics  [8].  In  the  framework  of  stochastic  differential  equations  this  ansatz 
was  introduced  by  Wentzell  and  Freidlin,  see  [12]  and  references  therein.  For  analogous 
consideration  in  the  framework  of  passive  scalar  equation  see  [17]. 

In  our  case  the  averaging  principle  can  be  briefly  summarized  as  follows:  Inside  the  cell 
the  deterministic  system  (7)  can  be  described  in  terms  of  motion  on  invariant  tori.  We 
parametrize  these  tori  by  the  values  of  H(x,y )  =  h .  In  the  perturbed  system  (at  least 
for  small  perturbations)  particles  will  still  rotate  rapidly  along  the  tori,  however  they  will 
slowly  drift  across  the  tori.  To  describe  this  slow  drift  we  calculate  dH(Xt)Yt).  Using  Ito’s 
formula  we  obtain 


dH(Xu  Yt )  =  y/2e  VH(Xt,  Yt)  -dWt  +  e  A  H(XuYt)dt,  (12) 

where  dWt  =  (dW^ .  dwjy^ )  is  the  two-dimensional  Brownian  motion.  Of  course,  we 
cannot  evaluate  the  terms  \7H(Xt,Yt)  *  dWt  and  A H(Xt)Yt)  explicitly  without  solving 
equations  (2).  However,  using  the  integral  form  of  (12) 

H(XuYt)  =  H(X0,Y0)  +  y/2e  f  VH(XS,YS)  ■  dWs  +  e  t  AH(Xs,Ys)ds,  (13) 

Jo  Jo 
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wc  see  that  because  of  the  smallness  of  the  perturbation  (i)  the  second  integral  is  approxi¬ 
mately  equal2  to  the  integral  fQ(AH)n=h(Xs,Y$)ds,  where 

(14) 

and  the  integrals  are  taken  over  the  level  set  {(x,  y)  :  H(x,y)  =  /?.}.  Moreover,  (ii)  the  first 
(stochastic)  integral  in  (13)  can  be  represented  as 

J\h(xs,ys)  -dws  =  w  Qf  |VH(x,,ys)|2dS) ,  (15) 

where  W(-)  is  a  one-dimensional  Wiener  process.  With  the  same  argument  as  before 
\VH(Xs,  Ys)\2ds  can  be  approximated  by  the  ergodic  average  of  |Vif|2 

<|Vff  fW  =  f\VH\dl  (16) 

The  obtained  effective  diffusion  process  is  most  conveniently  formulated  in  terms  of  an 
effective  Fokkcr-Planck  equation  with  time  rescaled  as  t  — ►  st 

dtp  =  d\  (A(h)p)  -  dh  ( B(h)p )  (17) 

Tlie  coefficients  A(h)  and  B(h)  given  by 

A(h)  =  (|V/7|2);y=/i,  B(h)  =  (A  H)„=h.  (18) 

In  the  case  of  H  given  by  (6)  with  B  =  1  these  coefficients  can  be  calculated  explicitly  using 
formulas  (9) 

A(h)  =  ^  -  2 h2,  B(h)  =  -2 h.  (19) 

7  K(VT^h?) 

Here  K(-)  and  E(-)  are  complete  elliptic  integrals  of  the  first  and  second  kind,  see  [7,  15]. 
Note  that  instead  of  the  Hamiltonian  we  could  have  used  the  action  variable  (11).  Indeed, 
the  formulation  in  terms  of  the  action  turns  out  to  be  very  convenient  for  generalization  of 
the  averaging  principle  to  higher  dimensions. 

2.2  Construction  of  an  approximating  Feller  process  (Wentzell-Freidlin 
technique) 

In  this  subsection  we  study  asymptotic  behaviour  of  solutions  of  (2)  constrained  to  a 
bounded  domain  (with  periodic  boundary  conditions)  approximating  them  by  a  process 
with  essentially  one-dimensional  state  space.  We  have  seen  that  (17)  specifies  completely 
the  behaviour  of  the  system  up  to  the  first  exit  time  out  of  a  cell.  Once  having  left  the 
cell  after  some  transitional  time  (during  which  it  will  stay  in  some  neighbourhood  of  the 
boundary  of  the  original  cell)  the  particle  will  again  slowly  diffuse,  either  in  the  original  cell 
or  in  another  neighbouring  cell. 

2This  is  duo  to  t.lio  averaging  principle. 
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Figure  1:  Reeb  graphs  of  the  flow  (6)  for  the  cases  JB  =  1,0<JB<1  and  B  =  0  on  the 
domain  of  width  2n  x  2n. 


To  describe  the  transition  phase  of  the  particle  from  one  cell  to  another  we  need  to  (i) 
give  a  geometrical  description  of  the  way  in  which  the  cells  are  connected  to  each  other,  (ii) 
analyze  in  detail  the  behaviour  of  the  diffusion  process  during  this  transition. 

The  geometrical  description  is  given  using  the  topological  notion  of  a  Reeb  graph  of 
H(x:y)  [9].  For  a  general  Hamiltonian  H(x,y)  it  is  defined  in  the  following  way:  Let 
(xc, yc)  be  a  critical  point,  i.e.  a  point  such  that  V H{xC)yc)  =  0.  Then  each  connected 
component  of  the  level  set  H~l(H{xc))  is  identified  with  a  vertex.  The  points  on  the  edges 
which  connect  the  vertices  are  identified  with  connected  components  of  the  noncritical  level 
sets  H~l(H(x,y)).  Figure  1  shows  the  Reeb  graphs  of  H(x,y)  given  by  (6)  on  the  domain 
[0,27r]  x  [0,27 r]  for  the  cases  J3  =  0,  0<B<1,B  =  1.  Note  that  in  the  case  B  =  1  the 
edges  of  the  Reeb  graph  correspond  to  the  interior  of  the  cells. 

The  geometrical  description  above  suggests  that  the  appropriate  state  space  of  the  pro¬ 
cess  approximating  the  solutions  of  (2)  is  the  Reeb  graph  T  of  the  Hamiltonian  H(x,  y)  with 
edges  denoted  by  e*  and  vertices  denoted  by  Ok •  In  side  each  edge  the  approximating 
process  is  governed  by  the  evolution  equation  of  the  type  (17) 


dtPi  =  dl  ( Ai(h)pi )  -  dh  ( Bi{h)pi )  (20) 

To  determine  the  process  completely  we  have  to  specify  the  boundary  conditions  at  the 
ends  of  each  edge,  “gluing”  the  edges  together  in  a  consistent  way.  This  “gluing”  procedure 
determines  the  behaviour  of  the  process  during  the  passage  through  the  vertices  of  the 
graph.  Physically  it  describes  transitions  of  a  particle  from  one  cell  to  another. 

Specifying  boundary  conditions  for  a  stochastic  process  is  in  general  a  quite  delicate 
point.  One  possibility  to  specify  the  boundary  conditions  is  to  require  the  approximating 
process  to  have  “nice”  mathematical  properties.3  In  [13]  all  possible  continuous  Markov 
processes  with  Feller  property4  on  graphs  were  described  such  that  the  diffusion  inside  an 
edge  is  governed  by  a  second  order  elliptic  operator  (which  can  possibly  depend  on  the 
edge).  In  our  case  the  ellpitic  operator  is  given  by  the  right  hand  side  of  (20).  Furthermore, 
at  each  edge  the  sum  of  the  incoming  probability  currents  has  to  vanish.  This  leads  to  the 

3 As  we  shall  see  later,  this  requirement  is  not  always  consistent  with  the  behaviour  of  solutions  of  (2). 

4 This  means  that  in  course  of  time  continuous  distributions  of  probability  remain  continuous. 
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following  boundary  conditions  at  a  vertex  O k 

^  ^  Ji{Ok)  ~  0,  Pi{Ofc)  =  Pj(Ok)i  i,j  :  ~  Ofc,  ej  ~  O^  (21) 

i :  e*~Ofc 

where  e^e*  ~  Ok  denote  the  edges  incident  to  the  vertex  O*. 

Thus,  the  approximating  technique  (in  the  following  referred  to  as  Wentzell-Freidlin 
technique)  consists  in  (i)  approximating  the  solutions  of  (2)  inside  of  the  cell  by  (17), 
(ii)  gluing  together  the  cells  by  matching  continuously  the  probability  distributions  at  the 
boundaries  of  the  cells. 

In  the  case  of  the  Hamiltonian  specified  by  (6)  on  bounded  domain  we  expect  that  the 
process  on  the  graph  approximates  well  solutions  of  (2).  Furthermore,  similarly  to  [12]  a 
particle  spends  a  zero  time  (on  the  time  scale  et)  at  the  vertex.  From  this  follows  that  once 
the  particle  reaches  the  cell  (edge)  boundary,  it  can  hop  instantaneously  to  any  other  cell. 
Furthermore,  the  behaviour  of  the  process  after  it  reaches  the  vertex  does  not  depend  on 
its  prehistory,  i.e.  on  the  edge  it  came  from.  Thus,  the  transition  probability  from  one  edge 
to  another  is  not  dependent  on  their  spatial  separation.  Note  that  for  e  — *  0  this  does  not 
result  in  any  contradiction,  because  of  the  very  long  relevant  time  0(1 /e)  oo. 

Altogether  the  Wentzell-Freidlin  technique  yields  the  following  picture  for  advection  of 
passive  scalar  in  a  bounded  domain:  Let  us  chose  an  initial  coondition  which  corresponds 
to  the  passive  scalar  being  concentrated  in  the  center  of  a  cell  at  t  =  0.  Then,  as  time  goes 
on,  the  passive  scalar  will  slowly  (i.e.  on  time  scale  0(1 /e))  diffuse  untill  it  reaches  the 
cell  boundary.  As  soon  as  it  reaches  the  boundary  it  will  very  quickly  (instantaneously  on 
time  scale  0(1/6:))  spread  across  the  domain  along  the  network  of  separatrices.  After  this 
the  passive  scalar  will  penetrate  the  cells  on  slow  time  scale  0(1 /e)  untill  the  stationary 
distribution  is  established. 

However,  the  above  implies  immediately  that  the  Wentzell-frcidlin  technique  cannot  be 
applied  in  the  unbounded  case.  Indeed,  it  would  state  that  a  transition  from  one  edge  to 
another  happens  instantaneously,  no  matter  how  large  the  spatial  distance.  It  can  also  be 
readily  seen  from  the  structure  of  the  Recb  graph  corresponding  to  the  unbounded  case. 
Indeed,  in  this  case  the  Reeb  graph  consists  of  one  vertex  with  infinitely  many  incoming 
vertices. 

2.3  Diffusion  in  cellular  flows:  unbounded  case 

As  we  have  seen  previously,  the  knowledge  of  the  behaviour  of  solutions  to  (2)  in  the  vicinity 
of  the  cell  boundaries  is  of  crucial  importance.  To  describe  this  behaviour  one  has  to  take 
into  account  events  such  as  a  particle  crossing  of  a  separatrice  and  going  to  another  cell. 
Qualitatively,  we  can  discribe  them  as  follows:  We  artificially  separate  the  cells  by  channels 
(which  play  the  role  of  the  boundary  layers  along  the  separatrices)  to  take  into  account  the 
transport  along  the  boundaries.  Let  5(e)  be  the  width  of  the  channel.  The  motion  of  a 
particle  consists  of  two  types  of  events:  (i)  slow  motion  across  streamlines  inside  the  cells 
with  typical  time  spent  inside  a  cell  tcc]\  =  0(1  fe)  and  mean  square  displacement  0(e°); 
(ii)  fast  transport  in  the  channels  with  velocity  of  order  0(e°)  and  time  spent  inside  the 
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(22) 


channel  0(67  /e).  The  effective  diffusion  is  given  by 

n  f  (-^2)channel  \  n  / x2\ 

**  )  0{s) 

Setting  5(e)  =  e *  we  obtain  Aceff  =  0{yfe).  However,  the  width  of  the  channel  is  something 
that  we  have  to  insert  by  hand  into  this  model.  Furthermore,  for  solution  of  the  cell 
problem  the  width  of  the  boundary  layer  is  usually  assumed  to  be  y/e.  Nevertheless,  this 
simple  model  is  usefull  because  it  gives  an  additional  intuition  about  the  nature  of  diffusion 
in  the  cellular  flow.  Indeed,  numerical  simulations  of  (2)  show  long  flights  along  the  cell 
boundaries  interupted  by  trapping  of  the  particle  inside  the  cells,  see  Fig.  2. 


Figure  2:  Random  motion  of  a  Brownian  particle  the  cellular  flow  B  =  1. 


During  the  excursions  the  stochastique  motion  can  be  described  by  a  very  simple  one¬ 
dimensional  model 

d,Ot  =  |  cos  ©*  |  +  V2~edWt ,  (23) 

where  8  €  [0, 47r] .  The  backward  Kolmogorov  equation  is  given  by 

dtp  +  |  cos  8\dep  =  edgp.  (24) 
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Solutions5  are  most  easily  found  by  writing  the  drift  |  cos  0|  as  the  derivative  of  the  following 
continuous  potential 


V(6)  = 


—4 k  -  sin  6  for  x  €  [—  |  +  2kn,  |  +  2kir]; 

—2(2 k  +  1)  +  sin  6  for  x  €  [—  §  +  (2k  +  l)7r,  |  +  (2k  +  l)7r], 


(25) 


such  that  |  cos  6 |  =  —doV (6).  Note  that  this  potential  contains  two  parts:  a  periodic 
part  Vq(6 )  with  period  7r  and  a  tilting  force  F  =  Obviously,  the  function  Vq(B)  — 
V(6 )  +  OF  satisfies  Vq(0  +  n)  =  Vq(0).  Stationary  distributions  are  easily  found  which 
allows  to  determine  the  mean  velocity  and  the  effective  diffusion 


<  v  >= 


t—*OC  f, 


D  =  lim 

t— *  oo 


E[Ot2]  -  (E[Ot])2 
2 1 


(26) 


It  turns  out  that  the  mean  velocity  does  not  vanish  and  the  effective  diffusion  is  proportional 
to  1/e,  see  [22].  The  behaviour  of  the  diffusion  process  in  the  neighbourhood  of  the  cell 
boundaries  has  been  studied  in  [14]  using  boundary  layer  asymptotics.  It  turns  out  that 
the  time  a  particle  spends  in  a  cell  is  of  order  0(s). 

Note  that  events  (i)  and  (ii)  have  two  different  time  scales:  (i)  is  on  time  scale  0(1 /yfe) 
while  (ii)  is  on  time  scale  0(1/ e).  A  similar  situation  has  been  discussed  in  [18]  for  effective 
diffusion  along  a  pipe  with  semiinfinite  pipes  branching  off  the  main  pipe.  The  fast  motion 
happens  inside  the  main  pipe  while  trapping  occurs  inside  the  side  branches. 

Since  the  stochastic  motion  of  a  particle  inside  of  the  unbounded  cellular  flow  is  a 
mixture  of  random  walk  on  the  lattice  of  cells  (jump  process)  and  slow  diffusion  inside 
the  cells  (continuous  process)  we  cannot  expect  the  Wentzell-Freidlin  technique  to  hold 
on  unbounded  domains.  Indeed,  as  Fig.  3  shows,  in  numerical  simulations  of  diffusion  of 
passive  tracer  on  the  cellular  flow  (6)  we  find  high  gradients  of  the  passive  scalar  accross 
the  cell  boundaries. 


3  Approximating  process  with  jumps 

In  this  section  we  propose  a  generalization  of  Wentzell-Freidlin  method  to  the  case  of  un¬ 
bounded  domains.  The  main  idea  is  to  give  up  the  mathematical  condition  of  continuity 
of  the  process  at  the  vertices  of  the  Reeb  graph  which  in  the  previous  paragraph  has  been 
shown  to  inconsistent  and  allow  for  processes  which  can  have  jumps  at  vertices.  The  conti¬ 
nuity  condition  has  to  be  replaced  by  another  condition  which  takes  into  account  processes 
happening  at  the  scparatrices  analyzing  them  more  carefully  than  it  has  been  done  in  Sec¬ 
tion  2.2. 

We  begin  by  outlining  the  procedure  which  can  be  used  for  obtaining  the  approximating 
process  on  the  domain  R2.  We  label  each  cell  by  a  two-dimensional  integer  vector  (ni,ri2)  6 
Z2.  The  effective  evolution  equation  inside  each  cell  is  given  by 

detP(nun2)  =  dh  (^(%(n,,n2))  “  dh  (5(%(n, ,n2))  (27) 

Consider  now,  analogously  to  [20]  a  water-pipe  network  QeN  =  {(x,y)  G  ft  :  \H(x,  y)\  < 
N y/e}  around  the  separatrices.  The  corresponding  water-pipe  approximation  is 

f  A 0%  -  V±H  *  -  0,  (:r,  y)  G  (28) 

5Stocliastic.  equations  with  periodic  drift  are  intensively  studied  in  [21]. 
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t  =  32 


Figure  3:  Diffusion  in  the  cellular  flow  given  by  (6)  with  B  =  1  for  e  =  0.0009765625  at 
t  —  32. 


however,  instead  of  zero  gradient  von  Neumann  boundary  conditions  at  the  level  set 


£(rn,n2){N y/e)  -  {{x,y)  €  fl :  \H(x,y)\  =  Ny/i,  (x,  y)  lies  in  the  cell  (nu  n2)}  (29) 


used  in  [20],  we  introduce  the  following  conditions 

=  (D9\n^n2),  (x,y)  e  C{nun2)(Ny/i).  (30) 

The  two  descriptions,  in  the  interior  of  the  cells  and  at  the  cell  boundary,  have  to  be  glued 
together.  It  is  here  that  the  conservation  of  probability  enters.  We  have  to  match  probablity 
current  leaving  the  edge  with  the  probablity  current  entering  the  water-pipe  network.  This 
gives  the  equation 


i  (D6)kdl  =  e  [B(h)p{nun2)(h]  -  dh  (A(h)p{nun2))] h= 
J  Ck(N  y/e) 


Ny/i  ' 


(31) 
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The  expression  on  the  right  hand  side  is  just  the  probability  current  entering  the  boundary 
of  the  cell. 

Now  we  turn  to  description  of  the  approximating  process.  First  of  all,  in  order  to  describe 
the  transport  from  one  cell  to  another  in  the  asymptotics  e  — >  0  we  find  it  convenient  to 
consider  the  network  of  lines  connecting  the  neighbouring  cells  which  is  in  fact  a  network 
dual  to  the  water-pipe  network.  In  our  case  of  H(x ,  y )  given  by  (6)  with  B  =  1  it  is  just  the 
two-dimensional  lattice  Z2.  At  each  vertex  of  this  network  we  specify  a  function  /(ni)«2)(i). 
This  function  serves  as  a  boundary  condition  for  the  effective  Fokker-Planck  equation  (27) 
(with  the  original  time)  in  the  cell  (711,712).  For  a  particle  the  probability  of  leaving  this 
cell  through  the  boundary  adjacent  to  one  of  the  neighbouring  cell,  e.g.  the  cell  (ni,  n<i  +  1) 
is  proportional  to  /(ni,n2-i-i)  “  /(ni,n2)*  Therefore  the  discontinuous  part  of  the  process  is 
governed  by  the  Laplace  lattice  operator  on  the  lattice  Z2 

^Z2/(ni  ,n2  )M  —  /(nj+l,n2)(0  +  /(ni-l,n2)(0  +  /(ni,U2+l)(0  +  /(  m  ,n2— ]  )W~4/(  ni,ri2  )(t)  (32) 
The  evolution  equation  for  /(m,n2)W  given  by 

P{n  i  ,n2  )(h)dh  =  A  Z2/( 

ni,n2)  (33) 

In  this  way  we  obtain  a  coupled  system  of  equations  which  yield  the  complete  description 
of  the  approximating  process.6 

4  Discusion 

The  generalization  of  the  Wentzell-Freidlin  technique  proposed  in  the  previous  paragraph 
is  easily  generalized  to  the  cases  of  nonperiodic  cellular  flows.  In  fact,  it  suffices  to  replace 
the  Laplace  operator  on  Z2  by  the  Laplace  operator  on  the  network  (graph)  dual  to  the 
network  of  separatrices  of  the  original  flow.  Of  course,  the  spectral  properties  of  the  graph 
Laplacian  then  depend  strongly  on  the  topological  structure  of  the  dual  network.  Therefore 
the  cellular  structure  of  the  flow  can  nontrivially  influence  the  solutions  of  (33). 

One  application  of  the  Wentzell-Freidlin  technique  is  connected  to  the  study  of  reaction- 
diffusion  equations.  However,  as  has  been  stated  in  [10],  in  its  usual  formulation  it  is  not 
applicable  without  any  restrictions.  The  generalized  form  of  this  technique  proposed  in  the 
report,  seems  to  be  suitable  to  a  wider  range  of  applications,  including  nonperiodic  cellular 
flows. 
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fjln  fact,  analogous  constructions  wore  discussed  in  [19,  18]. 


£  £  f 

dt  f(nun2)  +  dt  J 
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Scattering  past  a  cylinder  with  weak  circulation 

John  Rudge 
August  25,  2005 


1  Introduction 

Wave  phenomena  arise  in  a  wide  variety  of  geophysical  problems.  Indeed,  in  this  year’s 
principal  lectures  a  main  focus  was  the  modelling  of  waves  in  the  ocean.  It  was  in  this 
context  that  ray  tracing  and  the  geometrical  theory  of  diffraction  were  introduced. 

An  important  distinguishing  feature  of  waves  in  the  atmosphere  and  the  ocean  is  that 
they  propagate  through  a  fluid,  and  that  fluid  is  often  already  in  motion.  Familiar  examples 
include  the  propagation  of  acoustic  waves  in  the  atmosphere  in  the  presence  of  winds,  or 
gravity  waves  in  the  ocean  in  the  presence  of  currents.  Ray  tracing  has  routinely  been 
used  to  solve  such  problems,  and  there  is  a  large  amount  of  current  research  devoted  to 
understanding  these  wave-mean  interactions. 

Diffraction  is  the  apparent  bending  and  spreading  of  waves  when  they  meet  an  obstruc¬ 
tion.  It  is  a  phenomenon  not  described  by  ordinary  geometric  optics.  However,  an  extension 
to  ray  tracing  called  the  geometrical  theory  of  diffraction  (GTD)  can  overcome  this  prob¬ 
lem.  On  the  whole  GTD  has  been  little  applied  to  wave-mean  problems,  and  the  focus  of 
this  project  is  to  understand  how  GTD  can  be  used  in  the  presence  of  a  mean  flow. 

We  consider  a  new  twist  on  the  canonical  problem  of  scattering  of  a  plane  wave  past  a 
circular  cylinder.  Scattering  past  a  cylinder  is  a  classical  problem  with  a  long  history.  A 
good  introduction  to  the  ideas  behind  this  work  can  be  found  in  [5]  and  in  particular  the 
application  of  GTD  to  the  circular  cylinder  can  be  found  in  [7].  Special  functions  abound 
in  scattering  problems,  and  [1]  is  an  invaluable  source  for  looking  up  their  properties. 

Our  problem  considers  the  addition  of  a  weak  circulation  around  the  cylinder,  which 
could  be  motivated  by  the  problem  of  modelling  weak  currents  around  an  island.  We 
emphasise  here  that  the  circulation  is  weak  as  this  simplifies  matters  considerably  [3] . 

The  geometry  is  shown  in  Figure  1.  We  let  the  radius  of  the  circular  cylinder  be  a  and 
take  coordinates  centred  on  the  cylinder.  We  take  the  plane  wave  to  be  incoming  from  +oo 
on  the  rr-axis.  The  cylinder  is  taken  to  be  impermeable. 

2  The  governing  equations 

Following  [2],  we  set  up  our  governing  equations  as  those  of  2-D  compressible  gas  dynamics, 
which  have  as  a  special  case  the  familiar  shallow  water  equations.  The  continuity  equation 
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Figure  1:  The  geometry  of  the  problem.  An  incoming  plane  wave  is  incident  on  a  cylinder 
with  circulation. 


is 


D  h 
D/ 


+  /iV  ■  u  =  0, 


and  the  momentum  equation  is 


^  +  _4_v  (*•>-■)  =  o. 

D  t  7  —  1  v  ' 


a) 

(2) 


Here  u  is  the  two  dimensional  velocity  vector  of  the  fluid,  and  h  is  the  density  of  the 
gas,  or  the  height  of  the  free  surface  in  the  case  of  shallow  water.  For  gas  dynamics  7 
is  the  polytropic  exponent  and  c  —  cq \/H~~]  is  the  undisturbed  sound  speed  for  a  gas 
of  uniform  density  H .  For  shallow  water  7  =  2,  Cq  =  g  the  acceleration  due  to  gravity, 
and  the  undisturbed  gravity  wave  speed  for  a  layer  of  uniform  depth  H  is  c  =  \/gH .  The 
corresponding  equation  of  state  is 


7 


(3) 


where  the  additive  constant  has  been  neglected.  The  momentum  equation  can  then  be 
written  in  momentum  flux  form  as 


d  (hu) 
dt 


+  V  •  (/;uu)  +  Vp  =  0. 


(4) 


We  will  assume  that  our  flow  is  irrotational,  V  x  u  =  0.  This  implies  we  can  write  u  in 
terms  of  a  velocity  potential,  u  =  V^>.  The  momentum  equation  can  then  be  integrated  to 
give  Bernoulli’s  equation 


+  =  G{t ), 

7  -  1 


(5) 


where  G(t)  is  an  arbitrary  function  of  time  alone.  Note  that  Bernoulli’s  equation  determines 
/?.  as  a  function  of  the  velocity  potential  <jf>. 
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3  Small  amplitude  waves 

3.1  Time  averaged  equations 

When  studying  wave  phenomena  it  is  often  useful  to  decompose  fields  into  a  time  averaged 
mean  part  and  a  disturbance  part,  namely  </)  =  </>  +  <f>\  where  (<f>/)  =  0.  Averaging  (1),  the 
averaged  continuity  equation  is 


V  ■  (hu  +  h'u')  =  0, 

(6) 

and  averaging  (5)  the  averaged  Bernoulli  equation  is 

o  hrr  1  1  /.  ,o  .  „2\ 

co - ^  =  constant  —  -  (Ju|  +  |u'|  j  . 

(7) 

We  perform  a  standard  perturbation  analysis  of  the  governing  equations  in  terms  of  a 
small  non-dimensional  wave  amplitude  parameter  rj.  We  will  assume  the  0(1)  flow  has  no 
disturbance  part,  and  that  the  0(rj)  flow  has  no  mean  part.  For  the  mean  flow  we  write 

4>  =$  +  0  +  rffo  +  ... , 

(8) 

h  ~H  +  0  +  h.2  +  ••• , 

(9) 

u  =U  +  0  +  ?72U2  +  ...  • 

(10) 

where  capital  letters  are  used  to  denote  the  0(1)  flow.  For  the  disturbance  we  write 


4>  —  0  +  ?7<^i  +  f]2(j) 2  +  ... , 

(11) 

h!  =  0  +  t] h\  +  T}2 h'^  +  ... , 

(12) 

U'  =  0  +  liu\  +  772U2  +  ... . 

(13) 

3.2  Mean  flow 

The  mean  flow  that  we  are  applying  is  that  of  a  line  vortex: 


(14) 

(15) 

(16) 


Here  e  is  a  non-dimensional  parameter  determining  the  strength  of  the  vortex.  Throughout 
this  work  will  neglect  terms  O  (e2).  Hence  (16)  becomes  simply  H  =  constant.  We  have 
non-dimensionalised  on  c  the  constant  undisturbed  wave  speed,  and  a  the  radius  of  the 
cylinder.  Note  that  the  maximum  mean  flow  occurs  on  the  cylinder  where  |U|  =  ec,  so  the 
non-dimensionalisation  is  such  that  e  is  a  ratio  of  mean  flow  speed  to  wave  speed  (a  Froude/ 
Mach  number).  The  circulation  F  associated  with  the  line  vortex  is  F  =  27reca.  Note  that 
the  chosen  mean  flow  is  incompressible,  V  *  U  =  0. 
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3.3  Linear  waves 

The  0{rj)  continuity  equation  is  found  from  (1)  to  be 


+  U  •  Vh\  +  HV %  =  0,  (17) 

where  we  have  used  incompressibility  of  the  mean  flow,  and  that  H  is  constant.  The  0(rj) 
Bernoulli  equation  is  found  from  (5)  to  be 


'■'  =  1  (-f  -u-V). 


Combining  these  equations  we  find 


?V2,'  Wi 

c  V  -  ~KPT 


dV<f)\ 


U  •  V  (U  •  V^i)  =  0. 


Neglecting  0(e2)  terms  this  becomes 


„2n2  jJ 


czVz^ 


—  2U  ■ 


or  in  polar  coordinates 


C2V2^/  _  dVi  2tcad2ft 
c  V  0^2  Z  ^.2 


In  the  case  of  no  mean  flow,  e  =  0  and  this  reduces  to  the  familiar  wave  equation. 
We  can  define  the  local  energy  density  E  by 

p  <?h?  «|u',|2 

Using  equations  (17)  and  (18)  the  energy  equation  can  be  derived: 


dE  ,  9  ,  ..  ,  /VU  +  VUT\  . 

+  v .  (ev  +  c%u[)  =  -hu\  •  — i —  •  u;. 


3.4  Terms  of  second  order  in  wave  amplitude 

On  the  whole  we  shall  not  be  concerned  with  second  order  terms,  but  they  are  important 
when  calculating  the  force  on  the  cylinder.  The  quantity  of  interest  is  the  time  averaged 
pressure 


2  2 

Cn—  Cn 


p  =  ~h->  =  ^Hy  +  r?  (  -clH^hf  +  4Hy~1h2  )  +  0(if). 

7  7  \  2  J 

The  time  averaged  Bernoulli  equation  (7)  implies  that  to  second  order  we  have 


-  -  -CqH1  3hf  +  CqH 1  2/)2  =  constant  —  U  •  112  —  huil2- 
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(26) 


Hence  we  can  rewrite  the  averaged  pressure  as 


H : 


p  =  constant  +  r?2  (  —/if  -  —  \u[\  -  HU  •  u2  +  0(?73). 


2  H 

Note  that  without  a  mean  flow  (26)  gives  the  0(r?2)  pressure  purely  in  terms  of  first  order 
quantities.  With  a  mean  flow,  U2  (a  second  order  term)  must  also  be  specified  to  calculate 
the  pressure.  To  0(e)  we  have  from  (18)  that 


h'  -  c1 


((*)'•■*»■«)■ 


(27) 


Hence  the  time  averaged  pressure  can  be  written  in  terms  of  the  velocity  potentials  as 
/ 

i 


P  =  constant  +  _  i  |V^  |2  +  -  U  •  V<^2  1  +  0(r?3).  (28) 


/  \  2 


1- 


1  Wl: 


The  second  order  term  we  are  interested  in  is  02,  and  since  its  term  in  the  above 
expression  is  multiplied  by  U  we  may  neglect  0(e)  terms  in  its  solution.  From  the  time 
averaged  continuity  equation  (6)  we  find  neglecting  0(e)  terms 


H\7  •  U2  =  -V  •  (^ufl  . 


(29) 


Now  by  time  averaging  the  energy  equation  (23)  we  find  that  V  •  (/i^Uj)  =  0(e).  Hence, 
the  leading  order  governing  equation  for  02  is  simply  Laplace’s  equation  V20 2  =  0. 


4  Eigenfunction  solution 

We  will  seek  time  harmonic  solutions  to  (20)  of  the  form  0'1(x,£)  =  <0(x)e”^t,  where  uj  is 
a  chosen  constant  angular  frequency,  and  the  real  part  is  assumed.  (20)  then  reduces  to 


C2V20  +  +  2iuoU  •  V0  =  0. 


(30) 


Let  /c 00  =  u;/c,  the  constant  wavenumber  at  infinity  where  the  mean  flow  is  absent.  Then 
this  can  be  written  as 


U 


V  0  +  +  2ikoo—  •  V0  =  0, 


or  in  polar  coordinates  as 


V20  +  + 


2iek00a  50 
r2  86 


0. 


(31) 


(32) 


which  in  the  case  of  no  mean  flow  is  the  familiar  Helmholtz  equation. 

(32)  can  be  solved  by  separation  of  variables.  Let  0(x)  =  R{r)Q{6 ).  Then 

r2R"  +  rR'  +  (k^r2  -  A 2)R  =  0, 

0"  +  2iek0OaQ'  +  A2©  =  0. 
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(33) 

(34) 


where  A  is  a  constant.  These  have  solutions  of  the  form 


(35) 

(36) 


R(r)  =^1’2)(fe00r), 

0(0)  —  ei(±A-efcooa)0. 

where  Hi  ’  ;(^)  are  Hankel  functions  of  the  first  and  second  kinds  of  order  v.  Since  we  must 
have  a  single  valued  function  of  0,  we  have  that  A  =  ±(m  +  e/c^a),  where  m  €  Z.  Also, 
since  //i^(z)  =  the  eigenfunctions  of  (32)  are  thus  just  H^2\k00r)eime , 

where  m  =  m  +  ek^a.  As  we  are  solving  a  self  adjoint  problem  these  eigenfunctions  are 
orthogonal  and  we  can  express  the  general  solution  in  terms  of  these  eigenfunctions  as 

^(r,  0)=J2  (Am Hli] (*W r)  +  (fcooO)  eim 6  (37) 

777  ez 

for  constants  Am,  Bm  to  be  determined. 


4.1  Green’s  function  for  a  point  source 

Consider  a  point  source  at  x0  =  (r0,  do)  in  polar  coordinates.  The  governing  equation  for 
the  Green’s  function  G(x,xo)  is 

V2G  +  klG  +  ^  =  <5(x  -  x°)-  (38) 

Using  the  eigenfunction  expansion  (37)  it  can  be  shown  that 


G(x  )  =  1  y  ggWm in)^(fcooa)  -  /^)(^oor,nin)^2)(fcooa)  „(l) 
8i^z  H&'Hkooa) 


12  m  l  thoo '  max  )  e  j 


(39) 

where  rmax  =  max(r, 7*o),  r,n -m  =  min(r, ro).  An  alternative  expression  for  G(x,xo),  easier 
to  compute  numerically,  is 


G(x.x0)=1-J2~ 

777  CZ 

(40) 

where  Ju(z)  and  Yv(z)  are  Bessel  functions  of  first  and  second  order  respectively. 

Not.e  that  since  the  problem  is  self-adjoint,  the  Green’s  function  satisfies  a  reciprocity 
relation  G(x,xo)  =  G*(xo,x),  where  *  denotes  complex  conjugation.  Since  G*(x,xq)  satis- 
fies  _  * 

V2G*  +  klG*  -  =  <5(x  -  xo),  (41) 

then  the  reciprocity  relation  can  be  simply  stated  as:  the  field  at  x  due  to  a  point  source  at 
xo  is  the  same  as  the  field  at  xo  due  to  a  point  source  at  x  with  the  direction  of  the  vortex 
reversed. 


( ^oo Unin  )  1  fn  ( ^oc^O  Yfn  ( kc 


H'^ikooa) 


3^  inin  )J7n(koca)  u{\) 

H  h  (kc 


im(0—0o) 
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4.2  Eigenfunction  solution  for  an  incoming  plane  wave 

We  want  to  find  the  field  due  to  scattering  of  an  incoming  plane  wave  on  the  cylinder.  The 
potential  if>i  for  a  plane  wave  incident  from  +oo  on  the  rr-axis  is 


ipi  =  e 


—ikac^  cos  0—iekooa6 


Note  that  the  above  expression  satisfies  (32)  neglecting  terms  of  0(e2).  Note  also  that  this 
expression  has  a  branch,  and  so  6  has  to  be  defined  so  that  — 7r  <  6  <  7r.  Unless  ek^a  is  an 
integer,  'ipi  will  be  discontinuous.  ^  can  be  expanded  in  terms  of  Bessel  functions  as 


Jm(kr)e-ilh^2eime 


(43) 

(44) 


To  solve  the  problem  of  scattering  on  the  cylinder  by  the  incident  wave  we  propose  a  solution 
of  the  form  -0  =  ^  where  'ipg  is  an  outgoing  scattered  wave  of  the  form 

{k<x>r)eim6 .  (45) 

77i  ez 


Applying  the  boundary  condition  ^  —  0  on  r  =  a  yields 


__  1  \  s  (^ooa)  +  (&oo&)  rr(l)/i  \  -im7r/2  im6 

MU,,.  T  r)e  e  , 


r'(2) 


A 


ttiGZ 


n£Hk ooo) 


=  1  ^  g^(fc00r)^J(fc0oa)  -  J^^fcoorJ^Cfeooo)^, 


7776  Z 


g  irhTz/2^imO 


(46) 

(47) 


The  expression  for  ^  can  be  rewritten  as 


m£Z 


Jfn  (  fcpo r  )  (  fcpo a )  V ?n  (  fc oo  r )  ( ^‘00  a )  -777777 /2  imO 


HI 


Wri. 


{k0 0d) 


(48) 


which  is  easier  to  compute  numerically. 

The  eigenfunction  solutions  are  useful  for  plotting  for  moderate  k^a.  However  for  large 
/cooa  a  large  number  of  modes  m  must  be  taken  to  provide  an  accurate  approximation.  Ray 
tracing  overcomes  this  restriction  by  providing  an  asymptotic  theory  for  large  k^a. 


5  Ray  Tracing 

Ray  tracing  can  be  used  to  provide  an  asymptotic  solution  to  (21)  for  a  slowly  varying 
wavetrain  embedded  in  a  slowly  varying  background  environment.  Let 

0;  ~z(x)eie(x’t),  (49) 
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where  we  will  suppose  the  phase  0  is  rapidly  varying,  and  the  wave  amplitude  2  slowly 
varying.  The  local  wavenumber  k  and  local  frequency  uj  are  defined  by 

a© 

k  =  ve,  =  -  — .  (50) 

The  standard  ray  tracing  equations  are  then  given  in  terms  of  the  dispersion  relation 


uj  ~  Sl(x,  k)  =  ck  +  U  *  k, 

where  k  =  |k|,  as  Hamilton’s  equations 

dx  dfl  dk  dQ 
dt=+dk’  d 

The  ray  tracing  equations  imply  that  duj/dt  =  0,  i.e.  that  absolute  frequency  is  conserved 
along  a  ray,  and  we  will  consider  uj  a  global  constant  along  all  rays.  The  group  velocity  cg 
is  given  by 

dx  ^ 

Cg  =  di =  ck + U)  (53) 

where  k  =  k /k.  Another  important  consequence  of  the  ray  tracing  approximation  is  the 
conservation  of  wave  action.  Define  the  intrinsic  frequency  by  uj  =  c/c,  the  frequency  of  the 
wave  in  a  frame  moving  with  the  fluid.  Then  the  wave  action  A  =  E/uj ,  where  E  is  energy 
density  defined  in  (22),  satisfies 


(51) 

(52) 


—  +  V  •  (Acg)  =  0.  (54) 

Note  also  that  the  energy  density  satisfies  equipartition  in  the  ray  tracing  approximation, 
namely 

c2E?  _  H\uff 
~  2 


5,1  Consequences  of  the  weak  mean  flow 


For  an  irrotational  mean  flow  there  is  a  curious  result  which  states  that  to  order  e  the  ray 
paths  are  straight  [3,  6].  However,  there  is  still  refraction  of  the  wave  due  to  the  0(e) 
variation  in  k  given  by 


k  —  p  _  £  ^  —  k  _  6^°°ac 

^  —  ^OO  A'00  —  ^OC  1 

c  r 

where  k^  is  the  wavenumber  vector  at  infinity  for  the  ray  in  question,  and  A)oo  = 
(Figure  2). 

The  phase  progression  along  the  ray  is  given  by 


(56) 

|koo  | 


0  =  J  k  •  dx  =  J  (koo  —  efcoodW)  •  dx 

=  constant  +  (k^  •  x  -  ek^aO) 


(57) 

(58) 


Since  k^  is  in  the  direction  of  the  ray,  this  can  be  written  as 


0  —  0q  +  koc  (s  —  sq  —  ea(6  —  6q)) 


(59) 
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Figure  2:  A  cartoon  of  refraction  of  the  incident  wave.  Two  incident  rays  are  shown,  one 
above  the  cylinder  and  one  below.  The  arrows  along  the  rays  indicate  the  direction  and 
magnitude  of  the  wavenumber  vector  at  various  points  along  the  ray.  The  rays  themselves 
are  straight,  but  there  is  refraction  from  the  changing  wavenumber  vector  given  by  (56).  Far 
away  from  the  cylinder  the  wavenumber  vector  aligns  with  the  ray  direction.  Note  that  the 
wavenumber  becomes  larger  as  the  ray  passes  the  cylinder  for  the  bottom  ray,  but  smaller 
for  the  top  ray. 


where  s  —  So  is  the  distance  travelled  along  the  ray,  and  0  —  do  is  the  angular  change  along 
the  ray. 

The  wave  action  is  given  by 


E  _  Hk z2 
U)  c 


(60) 


(54)  implies  that  V  •  (Acg)  =  0,  which  to  0(e)  implies  simply 


V  •  (^koo)  =  0. 


(61) 


Consider  an  infinitesimal  ray  tube  R  with  ends  E\ ,  E<i  orthogonal  to  the  ray.  Note  that  koo 
is  parallel  to  the  sides  of  the  ray  tube  and  orthogonal  to  its  ends.  Then  by  applying  the 
divergence  theorem  to  (61) 

0  =  f  V  •  (z2 koo)  dV=  f  z2 koo  •  n  dS  -  f  z2 k^  •  n  d S.  (62) 

Jr  J  E2  J  E\ 

Since  •  n  is  a  constant  this  leads  to  the  simple  result  that  z^dS  is  constant  along  a  ray 
tube. 

For  a  plane  wave  incident  from  x  =  oo  it  follows  that  the  incident  wave  field  is  <pi  = 
e-%koo(x-\-eae) ,  w]iere  we  }iave  prescribed  that  the  incident  wave  has  unit  amplitude. 
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Figure  3:  Close  up  of  reflection  at  the  cylinder,  where  it  can  be  considered  locally  as  a  flat 
wall.  Note  that  the  wavenumber  vectors  (the  small  arrows)  are  not  in  the  same  direction 
as  the  rays  as  they  hit  the  wall. 

5.2  Reflected  Wave 

We  now  consider  reflection  of  a  ray  on  the  cylinder.  Locally  we  can  consider  the  cylinder 
as  a  flat  wall  (Figure  3).  Define  new  coordinates  with  x  perpendicular  to  the  wall  and  y 
parallel.  Let  the  mean  flow  along  the  wall  be  U  —  (0,  ec).  Let  the  incident  ray  hit  the 

wall  at  (0,  0)  with  angle  of  incidence  a  and  the  reflected  ray  leave  with  angle  of  reflection 

/3.  Far  from  the  wall  the  wavenumber  vector  of  each  ray  is  in  the  same  direction  as  the 
ray.  Moreover,  since  u)/c  is  constant  everywhere,  both  incident  and  reflected  wavenumber 
vectors  must  have  same  magnitude  far  from  the  wall.  Hence  we  may  write 

k^,  =  koo  ( -  cos  O',  sin  o) ,  (63) 

k£o  =  fcoo  («>s  (3,  sin  (3) ,  (64) 

for  the  incident  and  reflected  wavenumbers  at  infinity  respectively.  Prom  (56)  we  see  that 
the  incident  and  reflected  wavenumbers  at  the  wall  are  given  by 

ko  =  k oo  (—  cos  a,  sin  a  —  e) ,  (65) 

ko  =  koc  (cos  f3,  sin  (3  -  e) .  (66) 

Hence  locally  we  have  that 

</;  =  +  V'r  =  2,c,kox+eo  +  zTeik  ox+©5.  .  (67) 
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(68) 

(69) 


Using  the  boundary  condition  =  0  at  x  =  0  we  find 

— Zi  cos  a  +  zr  cos  (3  =  0, 

— ©o  +  ©o  =  0. 

From  this  it  follows  that  —  zr  and  a  =  (3.  Hence  the  angle  of  incidence  is  equal  to  the 
angle  of  reflection,  and  the  reflected  wave  has  the  same  amplitude  and  phase  as  the  incident 
wave  as  it  leaves  the  wall. 


Figure  4:  Reflection  on  the  cylinder  with  an  incident  ray  of  angle  of  incidence  (3/ 2. 


We  now  return  to  the  global  view  (Figure  4).  Consider  a  ray  hitting  the  cylinder  at  an 
angle  of  incidence  [3/2.  Then  it  hits  the  cylinder  at  (x,y)  =  a(cos/3/2, sin/3/2).  At  that 
point  the  incident  ray  has  phase 

©o  -  — fcoo(acos  (3/2  +  ea(3/ 2).  (70) 

The  phase  progression  along  the  reflected  ray  is  given  from  (59)  as 

©r  =  ©5  +  koo  (s-so-  ea(6  -  (3/2)) ,  (71) 


where  s  is  the  distance  along  the  ray  from  the  focus,  and  so  is  the  distance  from  the  focus 
to  the  point  at  which  the  incident  ray  hits.  The  focus  is  the  point  inside  the  cylinder  from 
which  rays  locally  spread  out  from.  Geometrically  so  is  found  to  be  $o  =  a/2cos/3/2.  Hence 
combining  (69),  (70),  and  (71)  we  find  the  phase  progression  along  the  reflected  ray  as 

©r  =  &oo  f  s  —  cos  (3/2  —  eao  \  .  (72) 


Rays  spread  out  radially  from  the  focus.  z2dS  =  constant  along  a  ray  tube,  and  the 
incident  wave  has  unit  amplitude.  Hence  we  have  that  the  amplitude  of  the  reflected  ray  is 
given  by 


zr  — 


a  cos  P/2 


2s 


(73) 
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Hence  the  reflected  field  takes  the  form 


Ipr  = 


a  cos  {3/2 
2s 


gikoo  (s—3a/2  cos  /3/2 —eaO) 


(74) 


5.3  Diffracted  Wave 

To  calculated  the  diffracted  field  we  first  go  back  to  the  problem  of  a  point  source  rather 
than  an  incoming  plane  wave,  as  we  will  find  diffraction  coefficients  by  comparison  with  the 
Green’s  function  of  a  point  source.  We  apply  the  geometrical  theory  of  diffraction  (GTD) 
to  the  problem  (Figure  5). 


Figure  5:  Cartoon  of  the  geometrical  theory  of  diffraction.  The  grazing  ray  hits  normal 
to  the  cylinder  and  produces  a  surface  ray.  This  surface  ray  travels  around  the  cylinder 
constantly  shedding  diffracted  rays  normal  to  the  cylinder. 


5.3.1  Incident  rays 

Consider  a  point  source  located  at  the  point  (ro,0)  in  Cartesian  coordinates  (Figure  6). 
Consider  the  two  rays  which  leave  this  point  and  hit  the  cylinder  at  right  angles.  Let  a  be 
the  angle  between  the  point  at  which  the  rays  hit  the  cylinder  and  the  horizontal.  Then  the 
wavenumber  at  infinity  for  this  ray  is  given  by  koo  =  (—  sin  a,  ±  cos  a)  where  +  is  the  top 
ra.y  and  —  is  the  bottom  ray.  At  the  points  at  which  the  rays  hit,  U  =  ±ec(—  sin  a,  ±  cos  Of), 
so  that  the  top  ray  hits  going  with  the  flow,  and  the  bottom  ray  hits  going  against  the  flow. 
The  wavenumber  vector  at  the  points  the  rays  hit  are  then  given  by  (56)  as 

k  =  fcoo(l  T  c)(—  sin  a,  ±  cos  a),  (75) 

or  in  terms  of  the  unit  vector  e$  as 

ktop  =  A'oo(l  -  e)ey.  kbot  =  -k^  1  +  ()ee.  (76) 
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Q(r0,0) 


Figure  6:  Geometrical  theory  of  diffraction  construction  for  a  point  source.  There  is  a 
point  source  at  Q  and  we  are  observing  the  field  at  a  point  P  in  the  shadow  region.  Two 
rays  paths  are  shown,  one  which  involves  an  anticlockwise  surface  ray  from  Q\  to  P\,  and 
another  which  involves  a  clockwise  surface  ray  from  Q2  to  iV 


Hence  as  these  incident  rays  hit  the  cylinder  their  wavenumber  vectors  are  tangent  to  the 
cylinder.  The  phase  progression  along  these  two  rays  is  given  by  (59)  as 


0(Qi)  =  k 


■00  1  A/  r0  “  a  eaa 


©(<32)  =  koo  (\fr%  -  a2  +  eaa)  • 


The  field  due  to  a  point  source  in  free  space  with  no  mean  flow  has  the  form 

Ait/AAk^r' 

v  4  0  v  00  1  y/8^k^P 


(77) 

(78) 

(79) 


where  r!  is  the  distance  from  the  point  source.  Hence  the  amplitude  of  the  rays  as  they  hit 
the  cylinder  is  given  by 

e^/4 


zl(Qi)  = 


y&jrkooy/rfi  -  a2 


(80) 


5.3.2  Surface  rays 

In  the  geometrical  theory  of  diffraction,  a  ray  hitting  the  cylinder  at  right  angles  causes 
the  production  of  a  surface  ray.  This  ray  is  contrained  to  go  around  the  cylinder,  and 
sheds  diffracted  rays  tangent  to  the  cylinder  as  it  progresses  around.  On  the  surface  ray 
k  =  ±ke$.  The  surface  dispersion  relation  is  then 

oj  =  f2s(x,  k)  =  ck(  1  ±  e)  (81) 

which  is  a  constant  independent  of  position.  Hence  the  magnitude  of  the  wavenumber  vector 
is  a  constant  along  the  surface  ray,  and  depends  only  on  the  direction  of  travel.  For  a  surface 
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ray  travelling  with  the  flow  k  =  fcoo(l  -  c)e0,  and  against  the  flow  k  =  —  fcoo(l  +  €)e#.  Thus 
the  corresponding  phase  progression  along  the  with  flow  surface  ray  is 

Q(Pi)  =  0(Qi)  +  koo(l  -  efau  (82) 

where  71  is  the  angle  travelled  around  the  cylinder.  Similarly  for  the  against  flow  surface 
ray 

0(P2)  =  0(Q2)  +  fcoo(l  +  e)a72,  (83) 

where  72  is  measured  in  the  opposite  direction. 

Using  the  GTD  we  assume  the  amplitude  of  the  surface  ray  is  proportional  to  the 
amplitude  of  the  incident  ray  that  created  it.  Namely,  that 

ZS{Q\)  =  di(QiK(Qi),  (84) 

where  di(Qi)  is  a  diffraction  coefficient  depending  only  on  the  curvature  of  the  surface.  In 
the  GTD  it  is  proposed  that  the  rate  of  decay  of  wave  action  A  travelling  along  the  ray  is 
proportional  to  the  wave  action.  Namely,  that 

A  A 

—  =  —2a  A,  (85) 

da 

where  a  is  arc  length  along  the  ray  and  a  is  a  constant  depending  solely  on  the  curvature 
of  the  surface.  The  wave  action  of  the  surface  ray  is  proportional  to  the  square  of  the 
amplitude  and  hence  the  surface  ray  decays  exponentially  in  amplitude  as 

zs(P1)  =  e-°a^(Q1).  (86) 


5.3.3  Diffracted  rays 


As  the  surface  ray  travels  around  the  cylinder  it  sheds  diffracted  rays.  A  diffracted  ray 
leaves  tangent  to  the  cylinder,  so  the  phase  progression  along  the  diffracted  rays  is  given  by 
a  similar  equation  to  the  incident  rays,  namely 

©(F)  =  0(F,)  +  koo  (v/r2  -  a2  -  ea/?)  ,  (87) 

0(F)  =  0(Fi)  +  A-oo  (\/r2-a2  +  ea/?)  ,  (88) 

where  (3  is  the  appropriate  angular  progression  after  leaving  the  cylinder.  We  assume  the 
amplitude  of  the  diffracted  ray  is  proportional  to  the  amplitude  of  the  surface  ray  which 
shed  it.  As  diffracted  rays  leave  the  cylinder  they  spread  out,  and  so  the  amplitude  is 
inversely  proportional  to  the  square  root  of  the  distance  from  the  cylinder.  Hence 


\p)  = 


d2(Pi) 


(kx  Vr2  -  a2)1/2 


(89) 


where  d2(Pi)  is  a  further  diffraction  coefficient  dependent  only  on  curvature,  and  fcoo  is 
an  appropriate  non-dimensionalisation  factor.  By  the  reciprocity  relation  of  the  Green’s 
function  we  must  have  that  d\{Q\)  =  d2(Pi)  —  d ,  a  constant.  However  there  is  still  a 
possibility  that  the  diffraction  coefficient  d,  and  decay  coefficient  a  may  depend  on  the 
direction  of  travel  around  the  cylinder,  whether  going  with  or  against  flow.  However,  when 
considering  an  asymptotic  evaluation  of  the  Green’s  function  later  it  will  turn  out  that  they 
do  not. 
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5.3.4  The  diffracted  field 


Diffracted  rays  are  simply  ordinary  geometric  optics  rays.  Combining  the  phase  progression 
equations  we  find  that  for  the  top  travelling  rays  the  phase  at  P  is 


0(P)  =  koo  rl  -  a2  +  ayi  +  V r2  -  a2  -  eao'j  , 
and  for  the  bottom  travelling  rays  the  phase  at  P  is 


(90) 


where 


r  o  r 

Combining  the  amplitude  equations  we  find 

eiir/4e-aald2 


a2  +  aj2  +  'Jr1  —  a2  +  ea  (27 r  —  0) J  , 

(91) 

1  a  a 

■  COS  - - cos  — , 

(92) 

r0  r 

a  -1  a  -ia 

—  9  —  cos - cos  - . 

(93) 

Z(P) 


(87 —  o?\/r2  —  a2)1/2 


(94) 


Hence 

UP) 


J7r/4d2 


_  i/coo  (ifcoo-ckj)a  (cos"1  ^+cos-1  ^ 

(Sirk^Q  y/r^  —  a2Vr2  —  a2) */2 
x  ^e(ifcoo(l-e)“0)a^  e(ifeoo(l+c)“Qi)a(27r-0)^  ^ 


(95) 


However,  note  also  that  there  are  also  further  rays  due  to  multiple  orbits  of  the  cylinder  by 
the  surface  ray.  These  rays  just  give  additional  factors  of  2nm  added  to  9  and  2ix  —  6  where 
m  E  N.  These  extra  terms  are  easily  summed  as  they  form  geometric  series.  Furthermore, 
when  the  ray  hits  it  excites  numerous  surface  rays  with  different  ctj  and  dj .  This  leads  to 
the  final  expression 


mp)=J2 


T/4d? 


(871-/0^  y/r^Pa^^/r^PPa?)1/2 


ifcoo(v/ro_a2+'/,r2_a2)_(i,Coo_ai)a(cos  1  ^+cos  1  7) 


a(ifcoo(l-e)— otj)a6 


3(i/coo(l+e)-aj)a(27r-0) 


X  |  - ^  ...  ,, - - - -  + 


l  _  e27ra(zfc0o(l“e)-a<7)  ]_  _  g2na{ikoo(l+e)-0ij)  J 


(96) 


Unfortunately,  to  obtain  the  coefficients  aj  and  dj  we  must  look  back  to  the  eigenfunction 
solution. 
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5.3.5  Asymptotics  of  the  eigenfunction  solution 

We  return  to  the  Green’s  function  solution  for  a  point  source  (39).  Write  this  as 


G(x,  x0)  =  iv+£fc ooaell/e 


(97) 


where 


F, 


1  Hi2\k00rmm)Hl(1\k00a)  -  H™  (k^a)  m 

H'W(kooa)  "  {  ° 


8  i 


(98) 


Let  F(v)  be  the  function  which  gives  analytic  continuation  of  Fv  to  all  complex  v.  Then 
by  performing  a  Watson  transform  we  may  write 

i  f 

G(x,x0)  =  --f  — - F(v  + ekood)  dv,  (99) 


2  Jy  sin  V7T 


where  T  is  a  contour  around  the  real  axis.  Exploiting  the  fact  F(v)  =  F(— v)  this  integral 
can  then  be  rewritten  as 


j  roo-\-iS  giv(Q— n) 

G(x,x0)  =  -  /  — : - F(u  +  efcoofl)  + 

2  ./-oo +iS  sin  w 


e-iv{0- 7r) 

— : - F(is  —  ekood)  du. 

Sill  JS7T 


(100) 


where  5  >  0,  with  the  contour  being  just  above  the  real  axis.  Consider  the  integral 
defined  by 

/•oo*fi<5  „±zV(0— 7r) 

— - - Flu  ±  (.hood)  di/.  (101) 

-oo+i<5  sin  ™ 

We  close  the  contour  in  the  upper  half  plane.  Note  that  this  can  only  be  done  in  the 
diffracted  region.  The  only  contribution  to  the  integral  comes  from  residues  in  the  upper 
half  plane.  We  get  residue  contributions  wherever  H'~1\kOQa)  has  zeros,  where  v  =  v^zek^a. 
For  large  P,  kooQ ;  the  zeros  of  H^(k ood)  are  given  by 


Vj  ~  koed 


1/3 


ei7r/3<?', 


(102) 


where  q'-  are  the  roots  of  the  derivative  of  the  Airy  function,  Ai '(</■)  =  0.  Hence 


vf  =  Vj  T  ekocQ.  ~  A*ooa(l  ^  f)  - 


,  \  1/3 

K°°°  I  p^/3r/ 
^3  _r  cnoou  noc“Aa  “T  ^  2  I  6  Vj* 

The  poles  are  in  the  upper  half  plane,  and  are  all  simple  so  we  find 


/±  - 


_ ±iuf(6— 7r)  (k  a) 

-7  E 


4  '  sin  i/^7r  "7 


The  factors  in  front  can  be  rewritten  as 
+.•«/+(*-*)  2icvSe 


(6>-7r)  2ie*^'  (27r~e) 


sin  i^+  7r 


e2™5  -1 


sin  I/.  7T 


e27ril5_  -  1 


(103) 


(104) 


(105) 
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Since  v  ~  k^a,  for  large  v,  k^r  the  Hankel  function  has  asymptotic  form 


H${k oor) 


nkooVr2  —  Q? 


pik00y/r2—Q^—iPj  cos"1  z7r/4 


(106) 


and  H^(/cooro)  will  have  a  similar  asymptotic  expression.  It  can  be  shown  that 

^2)(fe  qqQ)  _  e5t7r/6  /WA173 

$ jHpjHkooa)  2n(-q'j)  ^Ai(g')^  V  2  / 


(107) 


Combining  all  these  expressions,  and  comparing  with  GTD  solution  (96)  eventually  yields 
the  diffraction  coefficients  as 


p5i7r/6 


Oij  — 


a 


_ e27r/24 

lj=  (27r)V4(-5')i/2|Ai(^)| 


(108) 

(109) 


Note  that  q'3  is  real  and  negative,  and  Ai(g')  is  real. 


5.4  The  diffracted  field  of  an  incident  plane  wave 

Now  the  diffraction  coefficients  have  been  found,  the  case  of  an  incident  plane  wave  can  be 
solved  by  a  similar  construction.  The  corresponding  solution  is 


d ? 


Jkoo  Vr2-a 2  -  (ifcoo  -  aj )  a  (  7t/2+cos  1 


^  (koo\/r2  -  a2)1/2 

g(^oo(l-e)_Q:j)a^  e(i/coo(H-e)-aj)a(27r-0) 


+ 


J  __  e27ra(ifcoo(l— c)— J  _  e27ra(t/cco(l-f  e)-^) 


(110) 


6  Fresnel  Region 

We  return  to  our  governing  equation 


n  ,  9  ,  Ziekocd  d'tb 

vV  +  £  =  o. 


86 


Note  the  substitution  =  <pe  iek°°ad  yields  simply 


V  V  +  k^ip  =  0, 


(111) 


(112) 


the  Helmholtz  equation  for  tp  to  0(e2).  This  reflects  what  we  have  been  seeing  in  our  ray 
calculations  so  far:  the  effect  of  the  O(e)  mean  flow  is  simply  a  phase  shift  determined  by 
the  angular  procession  of  rays  around  the  cylinder. 
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Shadow  boundary 


Geometric  Optics 


Figure  7:  Cartoon  of  the  different  asymptotic  regions.  There  is  a  further  asymptotic  region 
in  the  neighbourhood  of  the  point  at  which  the  grazing  ray  hits  (the  Fock-Leontovich  region) 
which  we  have  not  solved  for. 

At  the  shadow  boundary  the  geometric  optics  field  is  discontinuous.  Moreover,  the  GTD 
solution  blows  up  at  the  shadow  boundary.  On  the  shadow  boundary  the  solution  takes  the 
form  of  a  Fresnel  integral  which  we  now  derive  (Figure  7).  Solving  for  the  Fresnel  region  is 
equivalent  to  solving  for  the  field  due  to  an  incident  wave  past  a  flat  screen.  It  is  important 
to  note  that  the  Fresnel  solution  has  no  knowledge  of  the  curvature  of  the  surface. 

Consider  the  ray  which  hits  the  top  of  the  cylinder  defining  the  shadow  boundary.  Far 
above  this  shadow  boundary  the  field  is  dominated  by  the  incident  field,  which  takes  the 
form  'ij>1  =  Q-^oox-ickocaO  rpj1js  motivates  searching  for  solutions  to  (111)  of  the  form 

(j)  —  ve-ikooX-itkooaO 

along  the  shadow  boundary.  We  are  solving  in  the  left  half  plane  for  a  wave  coming  from 
the  top,  so  0  must  be  defined  so  that  7r/2  <  6  <  3n/2.  Substituting  in  to  (111)  yields 

^ XX  ^ikoQVx  T  Vyy  =  0.  (114) 

1  /2 

Introduce  boundary  layer  variables  x  '  =  - x ,  y'  =  hob  (y  —  a).  This  boundary  layer  scaling 
implies  that  far  away  enough  from  the  cylinder  the  Fresnel  regions  fill  in  the  shadow  region. 
(114)  becomes 

Vx/x/  T  H-  koQVyfyf  —  0.  (115) 

Hence  expanding  for  large  k ^  we  find  the  leading  order  term  is  given  by 

2  iV()X'  +  V0  ylyl  =0.  (116) 

This  is  the  paraxial  wave  equation.  We  introduce  a  similarity  variable  r]  =  yf and  let 
Vo  —  /(??).  The  above  then  reduces  to 

=  0.  (117) 
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As  we  go  out  from  the  boundary  layer  we  want  the  solution  to  match  on  to  the  incident 
field  for  y  >  a  and  to  go  to  zero  for  y  <  a  in  the  shadow  region.  This  allows  us  to  give  the 


(118) 


(119) 


(120) 

where  it  is  important  to  note  that  9  is  defined  in  this  expression  so  that  — 37r/2  <  6  <  — 7t/2. 

We  now  have  three  asymptotic  expansions  which  are  valid  in  three  different  regions: 
the  geometric  optics  solution  is  valid  outside  the  shadow  region,  the  GTD  solution  is  valid 
inside  the  shadow  region,  and  the  Fresnel  solution  is  valid  in  a  neighbourhood  of  the  shadow 
boundary.  From  these  we  can  construct  a  uniformly  valid  solution  by  forming  a  composite 
expansion,  and  such  a  solution  is  plotted  in  Figures  8  and  9. 


final  solution  for  the  top  Fresnel  region  as 


w,/  _  -—ikooX—iekooaO  (\  +  e  ™/4  ^ (y  ~  g) 

^t0P_e  \2  +  y/2  [ 


where  7t/2  <  0  <  Sir/ 2  and  Fr  is  the  Fresnel  integral  defined  by 


Fr(*)=  />V2d, 

Jo 


A  similar  derivation  for  the  bottom  shadow  boundary  leads  to 


f  _  n-ik^x-iek^ae  (\,  e  m/4  pv  (  fc^2(~«  ~  V) 


bot 


-  +  _  Fr 

2-  V2 


i/— 7 XX 


Figure  8:  Plot  of  potential  4>  with  e  =  0,  k^a  =10.  Left  picture  shows  the  eigenfunction 
solution,  right  the  ray  tracing  solution.  The  two  plots  are  remarkably  similar,  demonstrating 
the  effectiveness  of  the  ray  tracing  approximation.  In  the  ray  tracing  solution  there  is  a 
small  numerical  problem  along  the  shadow  boundary. 
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Figure  9:  Plot  of  potential  (j)  with  e  =  0.25  (left  picture)  and  e  =  0.3  (right  picture)  and 
fcoott  =  10.  Notice  that  refraction  causes  wavefronts  to  be  out  of  phase  on  the  left  of  the 
cylinder  on  the  left  picture,  but  in  phase  on  the  right  picture. 

7  Force  on  the  cylinder 

We  arc  interested  in  calculating  the  time  averaged  force  F  on  the  cylinder.  This  is  simply 
given  by  integrating  the  time  averaged  pressure  around  the  cylinder 

F  =  —  J  pn  d S  =  —  j  p{ cos  0,  sin  6)a,  d 9.  (121) 

where  p  is  given  by  (28).  The  previous  sections  were  devoted  to  solving  for  (j)[ ,  but  to  find 
the  force  at  second  order  in  wave  amplitude  r/  we  must  also  specify  <j> 2.  To  the  order  we 
are  concerned  with  <fi 2  is  the  solution  to  Laplace’s  equation.  The  boundary  condition  on 
r  =  a  is  simply  IfJ  •  n  =  0,  but  the  question  of  what  boundary  condition  to  apply  at  00  is 
harder  to  answer.  We  could  apply  U2  =  0  at  00,  i.e.  no  Eulerian  mean  velocity  at  second 
order.  However,  this  would  imply  that  there  is  still  an  0(ry2)  mass  flux  past  the  cylinder 
given  by  the  Lagrangian  mean  velocity  is  non-zero.  This  phenomenon  of  net  mass 

flux  is  known  as  Stokes  drift.  A  perfectly  acceptable  alternative  way  of  setting  the  problem 
up  would  be  to  demand  no  mass  flux  past  the  cylinder,  i.e.  no  Lagrangian  mean  velocity  at 
second  order.  There  is  no  single  “right  way”  of  choosing  the  boundary  condition  on  U2  at 
00.  Once  we  are  given  the  constant  velocity  at  infinity,  U2  ~  172oo(cos  o,  sin  a)  as  r  — ►  00, 
Laplace’s  equation  has  the  classical  solution 

02  =  (r  +  -0  cos  (9  -  O') .  (122) 
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7.1  Force  calculation  by  ray  tracing 

It  is  straightforward  to  calculate  the  force  from  the  ray  tracing  solution.  The  dominant 
contribution  to  ip  comes  from  the  side  of  the  cylinder  on  which  the  wave  is  incident  (“the 
bright  side”),  and  there  the  field  is  given  by  geometric  optics  as  the  sum  of  the  reflected  field 
and  the  incident  field.  On  the  cylinder  the  reflected  field  has  the  same  phase  and  amplitude 
as  the  incident  field.  Hence  on  the  bright  side  of  the  cylinder 

iP  =  2—e'ik^cose+e0\  (123) 

LJ 


where  we  have  inserted  c2/cj,  an  appropriate  dimensional  factor.  We  calculate  each  term  in 
the  time  averaged  pressure  (28)  from  (122)  and  (123): 


1 

2c2 


deft 

dt 


iy  =  j_Re  WMb 

S  )  4c2  \  dt  dt  ) 


U) 

4c2 


1  d<p  i- 

c2  dt 


-U  •  V02  =  2ecu2oo  sin  (9  -  a) , 


W, 

(124) 

>|2  =  c2  (—  sin2  6  +  2e  sin  6 ) , 

(125) 

Re(iw^*U  •  Wip)  =  — 2c2esin0, 

(126) 

(127) 

where  we  have  used  the  identity  AB  =  Re(,4*i?)/2.  Thus  the  time  averaged  pressure  on 
the  bright  side  of  the  cylinder  is 


p  =  constant  +  p2Hc2  ^cos2  6  +  2e~ —  sin  ( 6  —  a)J  +  0(??3),  —tt/2  <  6  <  n/2.  (128) 

On  the  dark  side  of  the  cylinder  there  is  no  leading  order  contribution  from  the  linear  waves, 
but  there  is  still  a  contribution  from  the  U  *  Vep2  term: 


p  =  constant  +  rj2Hc2 


0  +  2e^^  sin  (6  —  a)^  +  0(t?3), 


—  71  <  6  <  —  7r/2, 
7t/2  <  6  <  7T. 


(129) 


Integrating  the  time  averaged  pressure  around  the  cylinder  we  find  the  0(r?2)  time  averaged 
force  is  given  by 

F  =  —J  p(cos  6,  sin  0)a  d9  =  p2H c2a  ^(—4/3,0)  —  2e7r^^(--  sin  a,  cos  a)^  ,  (130) 

or  in  terms  of  the  circulation  T  =  27 reca, 

F  =  7]2  (Hc2a(-4/ 3, 0)  -  Hu^T(-  sin  a,  cos  a)) ,  (131) 

where  the  last  term  can  be  recognised  as  the  usual  expression  for  the  Magnus  force  due  to 
flow  past  a  cylinder. 
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7.2  Force  calculation  by  eigenfunction  solution 

We  can  also  calculate  the  force  using  the  eigenfunction  solution  (48).  After  some  algebra, 
we  find 

F  =  rj1  (i7c2a(S(A:00a),  0)  -  Hu^T(- sin  a,  cos  a)) ,  (132) 

where  S(z)  is  a  real  valued  function  defined  by 


S  (z) 


2  i 


7T  Z 


Ez^  —  777-  {rn,  —  1) 

t/'O) ( „\  rj'(2)  ( ~\ 
me  1  \*)nm.-\ \z) 


(133) 


Note  that  S(z)  is  independent  of  e  so  that  the  only  0(e)  contribution  to  the  force  is  still  the 
Magnus  force  term.  Also,  by  comparison  with  the  ray  tracing  solution  we  have  S(z)  — >  —4/3 
as  2  — »  oo.  The  form  of  S(z)  agrees  with  a  similar  calculation  for  the  acoustic  force  on  an 
elastic  cylinder  given  by  [4]. 


8  Conclusions 

To  0(e)  we  have  found  expressions  for  the  field  resulting  from  the  scattering  useful  for 
both  small  k^a  (the  eigenfunction  solution)  and  for  large  k^a  (the  ray  tracing  solution). 
The  effect  of  the  0(e)  circulation  is  simply  to  add  phase  shifts  in  appropriate  places  in  the 
calculation,  and  importantly  it  does  not  change  the  diffraction  coefficients. 

The  0(e)  force  has  been  calculated.  It  is  important  to  note  that  we  now  have  to  specify 
parts  of  the  0(??2)  problem  which  we  didn’t  have  to  consider  in  the  no  mean  flow  case. 
However,  the  only  0(e)  contribution  to  the  force  turns  out  to  be  a  Magnus  force  due  to  the 
mean  Eulerian  flow  past  the  cylinder. 

To  solve  the  0(e2)  irrotational  problem  requires  a  lot  more  work.  Firstly  the  governing 
partial  differential  equation  is  no  longer  separable.  When  calculating  diffraction  coefficients 
comparison  with  the  eigenfunction  solution  was  essential,  and  so  this  is  an  important  stum¬ 
bling  block  for  application  of  the  geometrical  theory  of  diffraction.  To  0(c2)  the  rays  are 
no  longer  straight,  and  so  also  we  lose  a  lot  of  the  geometrical  considerations  which  made 
the  0(e)  mean  flow  problem  so  similar  to  the  no  mean  flow  problem. 

A  probably  tractable  generalisation  of  this  problem  would  be  to  look  at  the  case  of  a 
rotational  mean  flow  at  0(e).  Here  the  rays  are  still  no  longer  straight,  but  the  ray  curvature 
can  be  expressed  simply  in  terms  of  the  vorticity  of  the  mean  flow  [3,  6]. 
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1  Introduction 

Chemical  reactions  are  very  often  modelled  by  ordinary  differential  equations,  where  the 
concentration  of  a  particular  particle  evolves  according  to  a  deterministic  law.  But,  in  order 
to  be  able  to  answer  several  questions  where  the  discreteness,  hence  the  intrinsic  fluctua¬ 
tions  play  a  role,  one  needs  to  describe  the  system  by  a  stochastic  model.  We  assume 
the  chemical  system  to  be  well-stirred,  so  that  all  the  particles  are  distributed  in  space 
appropriately  uniformly.  Also  we  assume  the  number  of  reactions  occurred  to  be  Poisson- 
distributed,  i.e.  the  waiting  times  between  different  reactions  are  distributed  exponentially 
with  given  rates.  This  is,  in  a  nutshell,  the  essence  of  the  Kinetic  Monte  Carlo  (KMC)  mod¬ 
els  of  chemical  reactions.  These  models  are  discrete  and  non-deterministic,  as  opposed  to 
the  deterministic,  continuous  models  governed  by  Ordinary  Differential  Equations  (ODE), 
and  the  non-deterministic:,  continuous  ones,  governed  by  Stochastic  Differential  Equations 
(SDE).  The  last  two  models,  are  simpler  and  can  be  obtained  from  the  KMC  in  certain 
limits. 

The  drawbacks  of  the  deterministic  description  are  well-known.  It  leads  to  no  fluctua¬ 
tions,  a  very  important  characteristics  in  certain  cases.  The  model  governed  by  SDE  is  the 
so-called  diffusion  approximation  of  the  original  system.  It  is  a  non-deterministic  model, 
hence  it  detects  the  intrinsic  fluctuations.  However,  it  may  not  be  good  enough,  if  we  are 
concerned  in  the  exponentially  large  (small)  variables  and/or  we  deal  with  exponentially 
unlikely  events.  We  will  consider  the  Schlogl  model  as  the  simplest  model  with  bistability 
(two  stable  equilibriums  predicted  by  the  deterministic  description)  and,  as  an  example  of 
an  exponentially  large  observable,  we  will  consider  the  switching  times  between  two  stable 
states.  On  that  example  it  will  become  clear  why  the  diffusion  approximation  governed  by 
SDE  is  not  satisfactory. 

This  report  is  organized  as  follows:  first,  in  Sec.  2  the  Schlogl  model  is  introduced  as  a 
deterministic  one.  Next,  in  Sec.  3,  we  will  introduce  the  KMC  model,  where  the  number 
of  particles  evolves  as  a  Markov  jump  process.  In  the  Sec.  4  we  will  show  how  to  get  the 
simpler  descriptions  (ODE  and  SDE)  as  large  system  volume  limits  of  the  Markov  jump 
process.  Also,  in  order  two  motivate  the  use  of  the  exponentially  large  observables,  we  will 
introduce  the  simplest  theorem  in  the  large  deviations  theory,  in  Sec.  5.  Afterwards,  in  Sec. 
6,  we  will  apply  the  introduced  ideas  to  the  Schlogl  model,  emphasizing  the  calculations  of 
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the  mean  first  passage  times,  that  are  exponentially  large,,  hence  reveal  the  the  drawbacks 
of  the  diffusion  approximation.  Finally,  the  Sec.  7  will  be  devoted  to  a  short  discussion  and 
conclusions,  as  well  as  future  work. 

2  The  Schlogl  model  and  its  deterministic  description 

Consider  the  following  chemical  reaction,  introduced  by  Schlogl  [1]  as  a  catalysis  model: 


2V  +  A  3X 

h 


X^B. 

}C4 


(1) 


We  denote  the  number  of  particles  X ,  A  and  B  by  the  corresponding  letters.  Then,  keeping 
A  and  B  fixed  and  of  the  order  of  the  system  volume  V,  we  are  interested  in  the  evolution 
of  X .  ki  are  the  rates  of  the  corresponding  reactions  in  (1). 

The  simplest  description  of  the  system  is  the  deterministic  one,  where  the  concentration 
x  =  X/V  is  a  deterministic,  continuous  variable,  time  evolution  of  which  is  governed  by  the 
ODE: 


=  «(*)-«*(*).  (2) 

Here  we  denoted  u(x)  —  k\x 2  +  k 4  and  d(x)  =  k%xs  +  k$x. 

For  appropriate  choice  of  parameters,  the  function  f(x)  —  u(x)  —  d(x)  has  three  real 
roots:  the  middle  one  corresponds  to  the  unstable  equilibrium,  while  two  others  are  the 
stable  equilibrium  values.  See  Fig.  1.  Hence,  depending  on  the  initial  value,  x  will  expo¬ 
nentially  approach  to  one  of  the  two  stable  equilibriums,  see  Fig.  2.  In  a  certain  sense,  the 
Schlogl  model  is  the  simplest  one  that  leads  to  the  bistability.  Third  order  polynomial  is 
normally  the  first  choice,  if  we  want  to  have  a  function  of  three  real  roots. 

Although  the  deterministic  model  is  simple  to  analyze,  it  is  a  good  approximation  to 
the  real,  discrete  system  only  in  the  limit  of  large  volume  and  for  finite  time  intervals  [2,  3]. 
It  is  not  able  to  answer  to  questions  related  to  the  stochastic  behaviour  of  the  system, 
namely,  the  intrinsic  fluctuations  of  the  system  are  not  detected.  Consequently,  it  leaves 
open  the  question  of  relative  stability:  near  which  of  the  two  stable  equilibrium  states  we 
are  more  likely  to  find  the  system  at  a  randomly  chosen  time?  The  system  spends  most 
of  the  time  near  one  of  the  equilibrium  states,  but  fluctuations  can  sporadically  drive  it 
to  the  neighborhood  of  the  other  equilibrium.  In  order  to  analyze  the  intrinsic  fluctuative 
behaviour  of  the  system,  we  will  next  introduce  the  discrete  stochastic  model  (Kinetic 
Monte-Carlo  scheme)  according  to  which  the  chemical  reaction  happens. 
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3  Discrete  model  as  a  Markov  jump  process  (Kinetic  Monte 
Carlo) 

The  reaction  (1)  can  be  rewritten  as: 


XU^X  +  l 


xDSx-i. 


(3) 


Here  the  rates  of  moving  “up”  and  “down”  are,  correspondingly, 

U(X)  =  yX(X  -  1)  +  k4V  and  D{X)  =  ^ X(X  -  1)(X  -  2)  +  k3X.  (4) 

Recall  that,  without  loss  of  generality,  we  absorbed  A  and  B  into  the  system  volume  V . 

Hence,  our  model  is  one-step  Markov  jump  process  (see  [4])  and  is  governed  by  the 
following  Master  Equation  (forward  Kolmogorov  equation)  for  the  probabilities  F(n,t)  = 
P  {X(i)  =  n}: 

8P^  ^  =  U(n  -  1  )P(n  -  1,  t)  +  D(n  +  1  )P{n  +  1  ,t)  -  ( U(n )  +  D{n))P(n,  t )  = 

=  (C*P)(n,t ),  (5) 

where  £*  is  the  adjoint  of  the  generator  C  of  the  process.  The  generator  of  a  Markov  process 
X(t),  by  definition,  is  the  operator 

(£. :/)(„)  =  Jim  ,  (6) 

which  for  one-step  Markov  jump  processes  takes  the  form 

(£/)(n)  =  U(n)[f(n  +  1)  -  /(n)]  +  D{n)[f{n  -  1)  -  /(»)].  (7) 

The  generator  defines  the  Markov  process  completely.  Also,  if  we  define 

v(n,t)  =  E[f(X(t))\X(0)  =  n}  (8) 

for  any  observable  /(n),  then  v(n,t)  solves  the  partial  differential  equation  (backward  Kol¬ 
mogorov  equation) 

vt  =  Cv  (9) 

with  initial  condition  t?(n,0)  =  /(n),  see  [5]. 


4  Large  system  volume  limits 

In  order  to  analyze  large  system  size  limits,  let  us  pass  to  intensive  variable  x  =  n/V  =  en . 
In  terms  of  £,  the  generator  has  the  form 

(Cef)(x)  =  e~V(x)[/(x  +  e)  -  /(z)]  +  e~ldt(x)[f(x  -  e)  -  f{x)}.  (10) 
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Here 


ut(x)  =  eU(x/e)  =  /cix(x  —  e)  +  £4  =  i/,(x)  +  eui(x) 


and 

dt(x)  =  eD(x/e)  =  /c2 x(x  -  e)(x  —  2e)  +  k$x  =  d(x)  +  edj(x)  +  (9(e2). 

Hence,  the  leading  order  rates  are,  correspondingly,  u(x)  =  Aqx2  +  /c4  and  d{x)  =  fc2X3  +  /c3X. 

The  Markov  process  corresponding  to  the  generator  (10)  is  the  same  as  in  the  original 
model,  only  with  the  rates  and  jump  sizes  rescaled  by  e.  Instead  of  working  with  the  Master 
equation,  we  will  explore  the  generator  Ct  itself,  i.e.  the  backward  form 

Vt  =  Cev.  (11) 

Expanding  the  generator  (10)  in  the  small  e  limit  leads  to 

(A/)A)  =  [«(*)  -  d(x)]f'(x)  +  e[ui{x)  -  di(x))f(x)  +  |  [u(x)  +  d(x)]f"{x)  +  o(e2). 

To  the  leading  order,  we  get  the  differential  operator 

(A )f)(x)  =  [«(z)  -  d(x))]f{x)  (12) 

corresponding  to  the  deterministic  description  (2),  discussed  in  Sec.  2. 

To  the  next  order  O(e)  we  obtain  the  backward  form  of  the  Fokker-Planck  equation  (see 

[6]) 


(£fp/)(s)  =  [«(•*)  -  d(x)]f'{x)  +  e[u](.r)  -  dj  (x))f(x)  +  |  [n(.x)  +  d(x)]f"(x),  (13) 

which  leads  to  a  stochastic  differential  equation  for  x(t): 

d/x  =  [?i(.r)  —  d(x)  +  e(u\(x)  —  di(x))}dt  +  \/e(u(x)  +  d,(x))d,W.  (14) 

This  corresponds  to  the  diffusion  approximation  of  the  process,  with  drift  u(x)  —  d(x)  + 
e(u\(x)  —  d\(x))  and  diffusion  e[u(x)  +  d(x)]/ 2. 

In  contrast  with  the  deterministic  description,  the  diffusion  approximation  takes  the 
fluctuations  into  account,  but  it  is  not  good  enough  if  we  are  dealing  with  rare  events,  that 
arise  when  we  calculate  exponentially  large  (small)  observables.  In  the  next  section  we  will 
introduce  basic  large  deviations  ideas,  in  order  to  qualify  “rare  events”  in  a  more  formal 
way. 

5  Large  deviations  principles 

5.1  Large  deviations  principle  for  random  variables 

Let  us  start  off  with  the  large  deviations  principle  for  the  random  variables.  Suppose  we  have 
independent,  identically  distributed  random  variables  xt  with  the  common  mean  Ex*  =  m 
and  moment  generating  function  M(6)  =  E  e°Xi. 
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The  well-known  law  of  large  numbers  states  that  the  average  of  these  variables  goes  to 
m  in  probability  as  the  ensemble  size  goes  to  infinity: 


X\  +  ...  +  xn  p 


m 


as 


n 


n  — >  oo, 


hence,  for  any  a  >  m, 


f  x\  +  ...  +  xn  1 

| - n - >«j^0 


as 


n  — >  oo. 


(15) 


(16) 


The  natural  question  arises:  what  is  the  convergence  rate  in  (16)?  Since  e0x  is  a  monotone 
function,  we  have 

P  {Xl  +  ...  +  Xn>na}  =  P  |e«(*>+-+*»)  >  e8na }  <  e~6naE  c»(*i+-+*»).  (17) 

The  last  step  is  just  an  application  of  the  Chebyshev’s  inequality.  By  independence,  we  get 
P  {xi  +  ...  +  xn  >  no]  <  e-6naM{0)n  =  e-<6a-\oZM{e))  for  all  e  (18) 

Since  (18)  works  for  all  6  we  can  define  the  action  (rate)  function  1(a)  =  sup{#a  —  log  M (0)} 

e 

to  obtain  a  stronger  inequality: 


P  {x\  +  ...  +  xn  >  na}  <  e 


-nl(a) 


(19) 


The  basic  theorem  in  the  large  deviations  theory  states  that  the  bound  in  (19)  is  sharp, 
namely, 

X\  +  ...  +  xr 


1  1  td  /  Xl  +  Xn  ^  1 

-  log  P  < - >  a  > 

n  {  n  ) 


-1(a) 


(20) 


as  n  — >  oc.  See,  say,  [7,  8].  We  say  that  the  exponentially  unlikely  event  {x\  +  ...  +  xn  >  na} 
satisfies  the  large  deviations  principle  with  the  action  function  1(a). 


5.2  Large  deviations  expansion 

We  will  look  at  an  observable  v(x,t)  =  E[/ (X (t))\X (0)  =  x]  with  f(x)  =  eg^x^e  for  some 
function  g(x).  This  motivates  the  ansatz  v(x,  t)  =  We  plug  it  into  (11)  and  expand 

the  nearby  values  of  the  exponent  <fi(x,t)  to  obtain,  in  the  highest  order  e-1,  the  partial 
differential  equation 

<f)t  =  u(x)(e^x  —  1)  +  d(x)(e~'^x  —  1)  (21) 

with  initial  condition  (f>(x:0)  =  g(x ). 

^From  the  other  hand,  if  we  used  the  diffusion  approximation  with  its  generator  Cfp 
before  applying  the  WKB  ansatz  above,  then  we  would  get,  again  in  the  leading  order  c-1, 
a  wrong  PDE  for  the  exponent  function  <£(#,<),  namely, 

<h  =  \u(x)  -  d(x)](j>x  +  (22) 
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Therefore,  if  we  are  interested  in  exponentially  large  observables  with  the  exponent  of 
0(e_1)  =  0(V ),  then  the  diffusion  approximation,  hence  the  corresponding  SDE  (14)  lead 
to  a  systematic  error  in  the  exponent  function  <j>{x ,  t). 

We  may  go  to  the  next  order  by  taking  0(x,i)  =  4>o(x,t)  +  e<f>i (x, <).  Then  4>o(x,t ) 
satisfies  (21),  whereas  cj)\(x,t)  can  be  expressed  as  4>i(x,t)  =  In  z(x,t)  with  z(x,  t)  satisfying 

zt  —  zx{u{x)e(<f)o)r  +  d(x)e~^x)  +  *  (u(x)e^x  —  d,(x)e~^x) ,  (23) 

which  can  be  solved,  as  soon  as  (f)o(x,t )  is  found  explicitely. 

5.3  Moderate  deviations  expansion 

Let  us  now  look  at  an  observable  of  the  form  f(x)  —  e with  1  >  (S  >  e,  so  we  are 
dealing  with  moderate  deviations ,  as  opposed  to  the  large  deviations,  where  we  had  5  =  c. 
One  can  think  of  6  —  eQ  with  0  <  a  <  1.  This  will  motivate  the  moderate  deviation 
an  sat  z  v(x,t)  =  which,  with  both  the  correct  generator  Cc  and  the  Fokker-Planck 

generator  Cpp  in  the  backward  equation,  leads,  in  the  first  two  orders,  to  the  same  equation 
for  the  exponent  function  t) 


+  (w(*x)  ~~  d(x))4>x  + 


u(x)  +  d(x) 


= 0 


(24) 


with  initial  condition  </;(x,  0)  =  g(x),  as  before. 

Therefore,  we  can  claim  that  the  diffusion  approximation  is  good  enough  in  describing 
up  to  moderate  deviation  events.  At  least,  it  gives  the  correct  exponent  (action)  function. 


6  Solving  for  the  action  function 


6.1  Classical  mechanics  interpretation 

We  will  focus  on  the  large  deviations  case  v(x ,  t)  =  e^x,t^e  leading  to  the  equation  for  the 
action  function  (j){xA) 

(j)t  —  u(x)(e^T  —  1)  +  d(x)(e^x  —  1)  (25) 

with  initial  condition  </;(#, 0)  =  g(x). 

Notice  that  this  equation  is  of  the  Hamilton- Jacobi  form  <pt  +  H(x,4>x)  =  0,  hence 
can  be  solved  by  the  method  of  characteristics.  The  (x,  £)-plane  is  being  covered  by  the 
characteristics  (rays),  and  the  evolution  of  0(x,t),  as  well  as  x,  is  tracked  along  these 
characteristics  according  to  a  system  of  ODE,  see  [9,  10]. 

First,  we  read  off  the  formal  Hamiltonian  H(x,p)  =  u(x)(ep  —  l)  +  d(x)(e~p  —  1),  and  the 
momentum  is  introduced  by  p  =  (f)x  The  corresponding  Lagrangian  can  also  be  calculated: 


dx  f  + 
^l0g~ 


\[d  )2  +  4u(x)d(x) 

2«(x)  _ 


+u(x)+d{x) 


2 

+  4't/,(x)d(x).  (26) 
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Figure  3:  The  phase  portrait  of  the  Hamiltonian  system  corresponding  to  the  Hamilton- 
Jacobi  equations  for  the  action  function  </>(#,  t). 


The  system  evolves  along  the  characteristics  H  =  const  according  to  the  Hamiltonian 
system  of  ODE 


<  §  =  u(x)ep  -  d(x)e~P 


v  S  =  -u'(x){ep  -  1)  -  d'(x)(e  p  -  1) 


while  <f>(Xjt)  evolves  by 


dx 


d(j)  dx 

Tt  + 


-H(x,l >)+P^  = 


=  L  —  (x,p) J  =  u(x)(pep  ~  ep  +  1)  —  d(x)(pe  p  +  e  p  —  1). 

Also,  by  Hamilton’s  principle,  <p{x,t)  solves  the  variational  problem 


4>(x 


■t,=inf{/ 


L(xy  dx/ds)ds  +  g(x(  0)) 


}■ 


(27) 


(28) 


(29) 


where  the  infimum  is  taken  over  all  C1  [0,  t]  functions  x(-)  with  x(t)  =  x. 


6.2  Switching  times  in  the  Schlogl  model 

For  the  Schlogl  model,  H  =  const  paths  are  shown  in  Fig.  3.  Using  the  phase  portrait  of  the 
corresponding  Hamiltonian  system,  the  solution  <f>(X,  T )  of  Hamilton-Jacobi  equation  with 
an  initial  condition  <p(x,  0)  =  g{x)  is  obtained  in  the  following  way.  We  take  the  vertical 
line  x  =  X  and  trace  all  the  points  backward  for  time  T.  That  will  give  an  “initial”  profile 
f(x,p)  =  0  of  the  points  that  lead  to  X  at  time  T.  The  intersection  of  this  profile  with  the 
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real  initial  curve  p  =  g'(x)  will  give  the  initial  value  (in  fact,  there  could  be  more  than  one 
intersection  points),  therefore,  the  correct  characteristics,  leading  to  the  point  X  in  time  T. 
This  is  the  essence  of  the  “shooting”  method  [9]. 

As  mentioned  in  Sec.  2,  one  of  the  most  important  questions  for  bistable  systems  is: 
which  of  the  stable  states  is  more  stable?  To  answer  it,  we  should  compare  the  mean  first 
passage  times  (called  switching  times)  from  one  stable  state  to  the  other. 

Mean  first  passage  time  T(x)  from  a  state  a:  to  a  fixed  state  Xf  solves  the  backward 
equation  —1  =  £T(x)  with  the  appropriate  boundary  conditions  [4,  5].  For  the  one-step 
jump  Markov  processes  the  mean  first  passage  times  can  be  calculated  exactly  from  the 
backward  master  equation  and  it  leads  to  the  large  deviations  asymptotics  of  the  form  e^x^e 
for  some  action  function  <j)(x ).  In  general,  deterministically  forbidden  switches  between  two 
states  (in  the  deterministic  case,  see  Fig.  2,  switches  that  have  to  pass  through  x  =  2 
are  not  allowed)  are  large  deviation  events.  The  large  deviation  analysis  applied  to  the 
—  1  =  £T(x)  (as  opposed  to  (9),  discussed  above)  now  will  lead  to  the  time-independent 
Hamilton- Jacobi  equation 

0  =  u(x)(e^3'  —  1)  +  d(x)(e~^T  —  1)  =  H(x,(f)x).  (30) 


Therefore,  large  deviation  paths  on  the  phase  portrait  are  the  ones  corresponding  to  H(x,p )  = 
0.  These  are  the  hcteroclinic  connections,  and  they  correspond  to  p  =  0  and  p  =  log^y, 
as  can  be  seen  in  Fig.  4.  From  (27)  one  can  see  that  p  =  0  leads  to  the  deterministic 
description  x  —  u(x)  —  d(.r),  where  the  switches  between  the  first  and  third  equilibria  X\ 
and  .7:3  are  not  allowed  (in  order  to  go  from  one  to  the  other,  the  path  necessarily  leaves 
the  axis  p  =  0).  To  switch  from  x 1  to  7:3 ,  while  staying  on  the  H  =  0  curves,  the  system 
has  to  “climb”  the  non-deterministic  path  .xi-to-.T2  and  then  follow  the  deterministic  one, 

.7; 2- to- 3:3  on  the  x-axis.  Similarly,  the  switch  from  £3  to  xi  has  to  go  through  the  “valley” 
.X3-to-X2  and  then  follow  the  deterministically  allowed  path  on  the  x-axis,  X2-to-xi. 

By  (28),  the  exponent  (action)  <p  13  =  elogT]3  for  the  switching  time  T13  from  x\  to  £3 
can  be  found  by 


3  = 


dx 

'H+pjttd  dt== 


nX2  fX  2  1/  \ 

/  pdx=  /  log  dx  =  the  area  Si,  (31) 
lx ,  Jxi  u(x ) 


since  p  —  0  on  the  second  part  of  the  path  and  77  —  0  throughout  the  whole  path.  Similarly, 


rX2 

<t>3i  =  /  pdx 

dx  3 


f  X2 


log  —X\  dx  —  the  area  52- 
d{x) 


(32) 


For  instance,  from  the  Fig.  4  we  can  see  that  in  this  particular  parameter  regime 
S]  >  S2,  hence  T\ 3  >  T31.  Since  it  takes  longer  to  switch  from  x\  to  £3  than  vice-versa,  we 
can  conclude  that  the  first  equilibrium  is  the  more  stable  one. 

The  important  point  here  is  that  if  we  used  SDE  to  model  the  chemical  reaction  (or, 
equivalently,  if  we  used  the  diffusion  approximation  before  the  large  deviation  ansatz),  it 
would  not  give  the  correct  Hamiltonian,  hence  the  switching  times  would  be  miscalculated. 

Also,  the  parameters  Ay-  can  be  tuned  so  that  the  two  stable  equilibriums  are  equally 
stable,  and  there  are  certain  parameter  regimes  where  the  diffusion  approximation  gives  the 
opposite,  wrong  answer  to  the  relative  stability  question. 
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Figure  4:  The  H  =  0  curves  are  heteroclinic  connections.  They  correspond  to  p  —  0  and 
p  —  log  Parameters  are  picked  so  that  =  i  for  i=l,2,3  are  the  equilibrium  states. 


7  Conclusion  and  future  work 


In  this  report  we  have  introduced  the  Kinetic  Monte  Carlo  modeling  of  chemical  reactions, 
paying  particular  attention  to  the  benchmark  bistable  system  -  the  Schlogl  model.  The 
main  reason  of  using  KMC  is  that  the  simplest,  deterministic  model  of  a  chemical  reaction 
is  not  satisfactory  at  all  as  we  are  interested  in  fluctuations  in  concentrations.  We  have 
also  discussed  the  SDE  approach  to  the  problem  and  have  shown  that  it  is  not  able  to  cor¬ 
rectly  answer  questions  concerning  large  deviation  events/observables.  If  we  are  interested 
in  exponentially  large  observables  (e.g.,  switching  times  between  two  states  that  are  nor 
reachable  from  each  other  in  the  deterministic  case)  with  the  exponent  proportional  to  the 
volume  of  the  system,  then  SDE  approach  gives  a  systematic  error  in  the  exponent  function. 
The  reason  is  hidden  in  (10):  we  need  to  plug  the  exponential  ansatz  into  it  first  and  then 
expand  the  exponent  function,  as  opposed  to  the  SDE  approach,  where  we  expanded  (10) 
to  arrive  to  (13),  and  then  plugged  in  the  exponential  ansatz. 

In  fact,  the  work  can  be  carried  out  for  general  Markov  jump  processes  with  generator 


k 

(£/)(n)  =  A i(n)[/(n  +  e<)  -  /(n)] 
i=l 


and  its  rescaled  version 


(£«/)(*) 


k 

Y e-1Aj (x)[f(x  +  eei)  -  f(x)\, 
i= 1 


(34) 
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where  a  are  the  jump  sizes  (there  are  k  possible  ones)  and  A j(n)  are  the  e-independent 
propensity  functions  (rates)  corresponding  to  these  jumps.  Then  the  deterministic  gen¬ 
erator  is  (Cof)(x)  =  fef=]  Ai (ec)eil  f'(x)  and  the  Fokker-Planck  one  is  {Cppf){ x)  — 


(35) 


Yli-l  \(x)ei 

/'(*)  + 1 

Af(x)e( 

will  be  genera 

ized  as 

=  J>(a:)(ee<**  -  1). 


i=  1 


For  the  particular  model,  the  Schlogl’s  bistable  system,  we  answered  the  question  of 
relative  stability,  pointing  out  again,  that  SDE  approach  does  not  answer  it  correctly,  while 
the  deterministic  description  can  not  address  that  question  at  all.  We  have  found  the  exact 
formulae  for  the  switching  times  from  one  stable  state  to  the  other  and  vice  versa. 

Similar  questions  may  be  posed  for  2D  models.  The  further  work  may  include  exploring 
the  competition  models  from  population  dynamics  in  a  manner  of  KMC.  Here,  the  SDE 
approach  is  even  more  widely  used,  hence  it  is  important  to  understand  that  it  may  not  be 
good  enough  explaining  the  rare  events,  such  as,  in  some  cases,  the  competitive  exclusion 
of  one  species. 
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Simple  Models  with  Cascade  of  Energy  and  Anomalous 

Dissipation 


Ravi  Srinivasan 
July  18,  2006 

1  Introduction 

The  phenomenon  of  turbulence  remains  one  of  modern  physics  greatest  unresolved  chal¬ 
lenges.  Turbulent  fluid  flow  exhibits  an  extraodinarily  complex  structure  which  manifests 
itself  in  a  wide  range  of  length  and  time  scales,  posing  a  significant  problem  in  its  ana¬ 
lytical  study  as  well  as  in  numerical  simulation.  While  many  aspects  of  turbulence  are 
quite  controversial,  it  is  generally  accepted  that  turbulence  is  characterized  by  a  nonlinear 
transfer  of  energy  from  large  length  scales  to  smaller  and  smaller  ones,  wherein  energy  is 
dissipated  at  the  length  scale  of  the  molecular  viscosity  v  [5,  6,  8].  What  can  then  be  said 
about  energy  dissipation  in  the  regime  of  fully-developed  turbulence — that  is,  as  Re  — >  oo, 
or  equivalently,  as  v  — >  0? 

In  his  1949  paper,  Onsager  [7]  made  the  surprising  conjecture  that  turbulent  flow  can 
remain  dissipative  even  in  the  inviscid  limit  By  transferring  energy  to  ever  smaller  scales 
and  gradually  dividing  it  amongst  infinitely  many  degrees  of  freedom,  the  driving  mechanism 
behind  such  “anomalous  dissipation”  is  the  energy  cascade  itself!  Onsager  thus  suggested 
that  the  role  of  viscosity  in  energy  dissipation  is  secondary  to  that  of  the  cascade  process. 
The  purpose  of  this  paper  is  to  present  simple  exactly  solvable  models  which  exhibit  these 
very  features  of  a  cascade  of  energy  and  anomalous  dissipation,  and  to  demonstrate  that 
Onsager’s  conjecture  is  indeed  realizable  within  this  elementary  framework. 

1.1  Energy  Balance  and  Onsager’s  Conjecture 

We  begin  our  discussion  in  the  context  of  the  3D  incompressible  Navier-Stokes  equations 
with  viscosity  v  >  0,  forcing  f,  and  periodic  boundary  conditions 

f  u/  +  (u  ■  V)u  =  -Vp  +  i^Au  +  f,  x  6  D  =  [0,  L/]3 

\v-u  =  0 

where  f(x,i)  is  a  stationary,  homogeneous  forcing  acting  on  large  scales: 

Ef(x,  t)  =  0,  Ef  (x,  t)f  (x;,  t')  =  F  l  X  ^  ^  ”  t;)*  (2) 
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For  our  present  discussion,  let 


f(x,t)  =  a£  (wXi(t)sin^^ +  W^.(t)cos^1jei  (3) 

i= 1  '  ' 

which  satisfies  (2)  with  F(x)  =  ^ a 2  ]P?=1  cos  (27t|x  •  e$|).  Consider  now  the  energy  density 
of  the  system 

m  =  pi i  5Eu2(x’ t)d x  =  pi il  (t'L^ (Tt  J_t u2(x’ •  (4) 

where  the  last  equality  holds  by  ergodicity.  Multiplying  (1)  by  u  and  integrating  by  parts 
gives 


6(t)  =  §E|u|2<ix  =  -"M  L  E|Vu|2<ix  +  £  (5) 

where  £  =  |a2,  the  density  of  the  energy  flux  into  the  system  through  forcing,  appears  by 
Ito’s  formula.  Assuming  the  system  is  in  a  statistical  steady  state  (E(t)  =  0)  then  there 
exists  a  global  energy  balance  between  forcing  and  dissipation  through  viscosity: 

//|Vu|2<ix  =  e  <6) 
To  arrive  at  a  local  energy  balance,  consider  a  dimensional  argument  with  L  =  length 
and  T  =  time.  Since 

[L,]  =  L  |e]  =  p  M  =  ^,  (7) 

the  only  length  scale  which  can  be  derived  from  v  and  e  is  the  viscous  length  scale 


C  =  Ce  4  vl 


(8) 


which  vanishes  in  the  inviscid  limit.  From  a  local  perspective,  energy  that  is  pumped  into 
the  system  at  the  forcing  length  scale  Lj  cascades  to  smaller  and  smaller  scales  and  is 
subsequently  removed  from  the  system  at  the  length  scale  of  the  viscosity  C  (see  Figure  1). 

The  cascade  picture  is  more  readily  observed  in  the  Fourier  space  setting.  The  Fourier 
representation  of  (1)  is 


4*k  =  -iPu±  ^(q  •  Uk_q)uq  -  Hk|2uk  +  /k 


dt 


(9) 


where  Pkj.  =  I  —  is  the  projection  on  the  space  of  divergence-free  velocity  fields  and 


u,k  (t)  =  A  [  u (x,t)elk  xdx 

L3  J[0,L] 3 

fk(t)  =  \  f  f(x,t)elk'xdx. 
L  2  J[0,L]* 
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E(k) 


Figure  1:  Cascade  of  energy  from  forcing  length  scale  Lf  to  viscous  length  scale  lu . 

The  first,  second,  and  third  terms  on  the  right-hand  side  of  (9)  correspond  to  the  mechanism 
of  energy  transfer  between  modes,  dissipation  at  Z”1,  and  energy  input  at  LJ1,  respectively. 
By  Parseval’s  identity,  the  energy  equation  is  then 

W  =  |E^|nk|2 

k  (10) 

=  *EPl<xEE^q  "!’''k-q)('«ik  •  *q)  +  C.C.  “  P  +  £ 

k  q  k 

with  c.c.  denoting  the  complex  conjugate  of  the  first  term.  The  summands  within  the 
energy  transfer  terms  are  commonly  known  as  “triad  interactions”  due  to  the  appearance  of 
coupling  between  the  modes  k,  q,and  k  —  q,  consequently  resulting  in  a  nonlinear  transfer 
of  energy.  A  formal  rearrangement  of  the  sum  gives  that 

;EPkxEE(q  ■  «k-q)(*k  •  Uq)  +  C.c.  =  0,  (11) 

k  q 

implying  a  global  energy  balance  between  forcing  and  viscous  dissipation  analogous  to  (6) 
for  statistical  steady  state  solutions  to  3D  Navier- Stokes: 

i/  ^|k|2E|6k|2  =  e.  (12) 

k 

Is  this  formal  rearrangement  actually  valid?  If  we  presume  the  existence  of  a  steady  state 
solution  to  (1)  in  the  inviscid  limit  (Euler  equation  with  forcing)  then  (11)  is  strikingly 
false!  For  steady  state  solutions  to  Euler,  the  energy  transfer  and  forcing  terms  balance: 

*  E  PkxE  E^q  '  f,'k-q)K  ■  *q)  +  c  c •  =  £•  (13) 

k  q 


194 


The  fact  that  the  sum  in  (13)  does  not  vanish  provides  some  insight  into  the  lack  of  regularity 
of  solutions  to  the  forced  Euler  equation  [3,  4].  In  particular,  since  the  Fourier  coefficients 
of  u  do  not  decay  rapidly  enough  to  allow  absolute  convergence  of  the  sum  we  have  that 
such  solutions  maintain  shocks,  which  allow  for  the  anomalous  dissipation  of  energy.  This 
is  the  heart  of  Onsager’s  conjecture:  In  the  regime  of  fully-developed  turbulence,  steady 
state  solutions  correspond  to  the  most  regular  weak  solutions  of  the  3D  Euler  equation 
that  allow  for  anomalous  dissipation.  In  addition,  Onsager  proposed  that  weak  solutions  of 
Euler  conserve  energy  if  they  are  Holder  continuous  with  exponent  n  greater  than  1/3  [5]. 
In  Fourier  space,  the  Holder  condition  is 

(M) 

k 

so  if  the  previous  sum  is  absolutely  convergent  with  n  >  1/3  then  the  conjecture  gives  that 
the  formal  rearrangement  in  (11)  is  valid  and  energy  is  conserved.  The  sufficiency  of  this 
condition  was  proved  in  1994  by  Constantin  et  al  [2]  but  necessity  still  remains  an  open 
question. 

The  loss  of  regularity  of  steady  solutions  to  forced  3D  Euler  can  be  observed  through  a 
dimensional  analysis  argument.  Define  the  second-order  structure  function 

E(x  —  x', t)  =  E|u(x, t)  -  u(x', £)|2,  (15) 

where  the  homogeneity  of  solutions  u  to  (1)  has  been  used.  Then 

[EM]  =  ~  [x]  =  L.  (16) 

and  there  exists  a  function  T  such  that 

r{E'hx'rf%)-°-  (17> 

Assuming  isotropy, 

e  =  c'£§W§9(MM)  (.8) 

for  some  g.  Since  we  are  interested  in  the  inertial  range  lv  |x|  Lj  with  Lj  fixed, 
we  first  let  lv  — >  0  (that  is,  let  v  — >  0)  and  then  take  |x|  — »  0  to  arrive  at  the  celebrated 
Kolmogorov  two-thirds  law 

E  =  Ce  i  |x|  i ,  (19) 

where  we  have  made  the  assumption  that  lim^_o  <?(£,  v)  exists  and  is  finite.  In 

Fourier  space,  the  previous  display  is  equivalent  to  the  five-thirds  law 

£k~£5|k|-i  (20) 

with  =  E^kl2*  Using  (19)  and  Holder’s  inequality,  one  finds  that  the  velocity  field  is 
Holder  continuous  with  exponent  1/3: 
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E|u(x,  t)  -  u(x',  i)|  <  y/E(x  —  x',  t)  —  0(|x  —  x'js).  (21) 

Formally, 

E| Vu(x)|2  =  ^ lim^  j-—p-E| u(x  +  y)  -  u(x)|2  ~  |y |— t  ~  ~  v~l  (22) 

where  we  have  used  the  two-thirds  law  and  concerned  ourselves  with  the  regularity  of  u  at 
the  level  of  the  viscous  length  scale.  It  can  then  be  seen  that 

"isiXE|vu|2rfx~o(i)- 

Alternatively,  since  (6)  is  valid  for  all  v  >  0,  we  have  that  lim^o^py  /QE|Vu|2dx  =  € * 
Even  in  the  inviscid  limit  energy  is  still  removed  by  loss  of  regularity  of  solutions  to  Euler’s 
equation! 

1.2  Anomalous  Dissipation  in  Burgers’  Equation 

In  our  discussion  to  present  we  have  made  several  significant  assumptions,  such  as  that  of 
the  existence  of  a  unique  steady  state  solution  to  3D  Navier-Stokes  with  random  forcing 
(which  is  in  fact  a  reasonable  assumption,  see  [1]).  While  Onsager’s  conjecture  is  somewhat 
speculative  for  the  3D  or  2D  Navier-Stokes  and  Euler  equations,  it  is  realizable  and  easily 
illustrated  within  the  framework  of  Burgers’  equation  with  forcing  and  periodic  boundary 
conditions: 


1  7T 

Ut  +  ^(v2)x  =  vuXx  -  -  sin(27nr),  x  e  [0, 1].  (23) 

For  v  >  0,  (23)  admits  smooth  solutions  with  “shock  layers”  of  size  0(z/);  however,  if 
v  — *  0,  solutions  develop  discontinuities  which  allow  for  anomalous  dissipation  of  energy. 
In  Fourier  space,  the  solution  of  (23)  with  v  —  0  is 

u(x,t)  =  ^  bn(t)  sin(2?t7nr)  (24) 

n€Z 

where  bn(t )  =  Im  un(t)  =  2  u(x,  t)  sm(2nnx)dx  and  satisfies 


bn  =  2  /  utsm(2ri7rx)dx 
Jo 

=  2  /  [— uux  +  f(x)]sm(2n7Tx)dx 

Jo 

~  ~ 2t17T  ^  bmbm-n  ~  ~  (!??,=]  2-n=  —  1 )  • 


(25) 


By  Parseval’s  identity  and  the  previous  display,  the  unique  steady  state  solution  must  satisfy 
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Figure  2:  Dissipative  solution  to  forced  inviscid  Burgers’  equation. 


E(t)  =  ^  X  \hn  =  “  X  2nnbnbmbn-m  +  7T&1  =  0. 


(26) 


n,m€Z 


If  the  sum  in  the  previous  display  is  formally  reorganized, 

^  ^  2mxbn bmbn—m  ^  ^  ^  2 ?z Tib —nbmb  -~n—rn 
n,m&  n,m& 

=  -  X  2nirbnbmbp 

n-\-mJrp= 0 

=  “  X  2(n  +  m  +  p)iTbnbmbp 


(27) 


n+m+p=0 


-0. 


We  are  then  led  to  believe  that  the  system  has  no  steady  state  solution  (with  bi  ^  0).  Yet, 
it  is  simple  to  show  that 


w(x)  =  —  COS7TX  (28) 

is  a  solution  of  (23)  in  the  inviscid  limit!  How  can  this  be? 

The  answer  lies  in  the  fact  that  w  is  a  weak  solution  and  rearrangement  of  the  sum  is 
invalid  because  the  coefficients  bn  do  not  decay  fast  enough.  In  order  to  compensate  for  the 
lack  of  a  viscous  dissipation  mechanism,  w  has  lost  regularity  and  developed  shocks  (see 
Figure  2).  One  has  that 


8n 


bn  —  ’/ a  o 

7r(4  nz 


1) 


Tin 


as  |n| 


oo 


(29) 


so  b^  ~  0(\n\  2).  In  this  case,  the  energy  of  the  steady  state  E  =  ]Cnez5^n  <  00  an<^ 
dissipation  is  nonzero: 
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(30) 


^2  ^Kbmbn-m  =  7r&i  =  --  <  0. 

n,m62 

The  above  example  illustrates  that  there  exist  steady  state  solutions  with  finite  energy 
that  dissipate  through  shocks.  In  the  next  section,  we  develop  yet  simpler  models  that 
exhibit  an  energy  cascade  and  anomalous  dissipation. 


2  Simple  Models 

Consider  the  following  infinite  dimensional  dynamical  system: 

(an  =  or{(n  -  l)pan-i  “  npan+ 1}  +  f(t) ln=i,  n  €  N 
\a0  =  0 

where  a  €  R,  p  =  0, 1,2  and  f(t) ln=i  is  a  time-dependent  forcing  term  on  the  first  mode. 
The  system  (31)  describes  a  linear  shell  model  with  nearest-neighbor  coupling  and  (as  we 
shall  see)  the  feature  that  it  allows  for  anomalous  dissipation.  We  will  focus  here  on  the 
case  p  =  1  with  forcing  f(t)  =  \/2eW(t)  and  will  speculate  on  cases  with  p  /  1.  In  the 
case  of  white  noise  forcing — which  has  the  advantage  of  being  uncorrelated  with  the  modes 
a,n — an  energy  balance  relation  analogous  to  (5)  can  be  derived: 

E(t)  =  ^  y~]  ^E a2n  =  a  ^  E{(n  -  l)panan_i  -  npa„an+1}  +  e  (32) 

neN  n€N 

and  we  have  that  anomalous  dissipation  is  possible  if 

-a  y  E{ (??  -  l)po.nqn-i  -  r)pana,n+i }  =  a.NpEaNa.N+i  — *  £■  (33) 

Z '  N—*oo 

n<N 

This  requires  that  for  steady  state  solutions 

an  ~  n~p /2  as  n  — >  oo  (34) 

since  if  an  scales  with  any  other  exponent,  solutions  will  have  either  zero  or  infinite  dis¬ 
sipation.  This  is  consistent  with  Onsager’s  conjecture  since  steady  states  of  the  model 
correspond  to  the  most  rapidly  decaying  {an}  which  allow  for  anomalous  dissipation!  By  a 
simple  scaling  argument,  one  has  that  steady  state  solutions  must  lie  on  the  boundary  of 
the  weighted  Z2  spaces 


h,P  =  1  {an}neN  ■'^2nP  <  00  >  ,  (35) 

l  n€N  J 

with  no  dissipation  in  the  interior  of  l^v  and  infinite  dissipation  in  the  exterior  of  I2#  (see 
Figure  3).  The  representation  formula  for  the  unique  steady  state  with  p  =  1,  to  be  derived 
in  a  subsequent  section,  agrees  with  this  picture. 
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finite  dissipation  for  p  =  2 


finite  dissipation  for  p  =  1 


Figure  3:  Spaces  I2#  and  corresponding  rates  of  dissipation. 


2.1  The  case  p  =  1 

We  now  show  how  to  solve  the  simple  model  in  the  case  p  —  1  with  forcing  f(t )  =  V2eW(t). 
Some  remarks  on  the  case  p  =  2  wil  be  made  in  Section  3. 

Consider  the  set  of  Laguerre  polynomials  Ln(x )  which  satisfy 


(i)  L0{x)  =  1 

(ii)  (n  +  l)Ln+i(x)  =  (2n  +  1  -  x)Ln(x )  -  nLn_ i(a;),  n  e  N 

(iii)  xL'n(x)  =  nLn(x)  -  nLn-i{x) 

poo 

(iv)  I  Lm(x)Ln(x)e  dx  =  6nm 
Jo 

Let  /o  =  0 ,fn(z)  =  Ln-i(z )  for  n  E  N,  and  define 


Then  one  has  that 


9{z,t)  =  ^2<bi(t)fn{z). 

ne  N 


(37) 


£(*.*)  =  £  *.(*)/»(*) 
neN 

=  ^  a{(n  ~~  l)on-i  -  na„+i}/n(z)  +  y/2eW(t) 
ne  N 

=  ^2  aan{2zfn(z)  +  (1  -  z)fn(z)}  +  V2eW(t) 

ne  N 

=  2 az^-(z,t)  +  a(l  -  z)g(z,t)  +  V2 eW(t) 
oz 


(38) 
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by  shifting  indices  and  by  properties  (36)  of  the  Laguerre  polynomials.  Solving  the  stochastic 
partial  differential  equation  in  the  previous  display,  one  can  use  the  orthonormality  of 
{fn(z)}ne N  to  find  an: 


Furthermore, 


an(t)  = 


9(z>t)fn{z)e  Zdz. 


(39) 


1  f°°  1 

E(t)  =  J2~  Ean=  /  o  ^92{z,t)e 

n€  N2  y°  2 


(40) 


E(t)  =  Y  Eana»  -  /  E g(z,t)g{z,t)e  Zdz 

n€N  J° 

where  equality  follows  from  Parseval’s  identity.  Note  that  from  (38)  one  can  also  derive  the 
conservation  form 


=  99e 


9 


dg 


2ct2  — — b  a(l  —  z)g  +  \/2eW 

OZ 


r\ 

=  —  (azg2e~z)  +  \f2eWge 
uz 


(41) 


We  are  now  equipped  with  the  tools  necessary  to  determine  properties  of  explicitly  deter¬ 
mined  solutions. 


2.2  Representation  Formulas  for  the  Steady  State  and  IVP 

We  now  derive  the  steady  state  solution  of  (31)  for  p  =  1  with  white-noise  forcing  and 
examine  the  initial  value  problem  (IVP)  without  forcing.  To  simplify  discussion,  take  a  =  1. 
Solving  (38)  with  initial  conditions  go(z)  =  g{z,0)  =  an{fyfn(z),  one  has  that 


=  e^2+t^ 


e1 ^  2e  *) go{ze2i )  +  \/2e  j  2e  ^dW(s) 


The  explicit  representation  for  the  unique  statistical  steady  state  solution  is  then 

an(t)  =  V Ye  f  dW(s )  r  dzLn-^eV-s- f 
J — oo  Jo 

so  an(t)  is  a  Gaussian  field  with  mean  0  and  covariance 


(42) 


(43) 


OO  f'OC  roc 


*l+*2 

E(an(t.)am (/.))  =  2e  I  ds  I  dzy  I  d.z2Ln-i(zi)Lm-i{z2)e\-  2 
./ o  Jo  Jo 

2  e 


(e2s+l)] 


(44) 


n  +  771—1 
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In  particular, 


—  lim  7  {(n  —  l)E(anan-i)  —  nE(anari+i)}  =  lim  NEa^aN^i  =  e 
oo  N—>oc 

n<N 

and  we  have  anomalous  dissipation. 

A  simple  consequence  of  the  conservation  form  (41)  is  that  energy  is  conserved  if  one 
begins  with  finite  energy: 

y^Ea^(O)  <  oo  implies  y^Ea2(t)  =  Ea2  (0)  for  t  >  0.  (45) 

neN  nGN  n€N 

There  also  exist  dissipative  solutions  with  )T]neNEa2(0)  =  oo;  that  is,  solutions  such  that 

E|an|  >  Cn-1/2  as  n  oo.  (46) 

Finally,  as  we  have  shown  above,  with  forcing  there  exists  a  unique  statistical  steady  state 
with  equilibrium  distribution  supported  on  the  the  most  regular  dissipative  solutions,  i.e., 
such  that 


Ea2  =  0(n  *)  as  n  — »  oo. 


3  PDE  Approximation  to  Simple  Models 

When  scaled  properly,  the  system  (31)  very  closely  resembles  a  finite  difference  scheme  for 
the  wave  equation  ut  =  cux.  Keeping  this  in  mind,  for  a  >  0  one  can  find  PDEs  whose 
solutions  mimic  those  of  the  simple  model  for  the  IVP. 

We  begin  by  setting  a  =  1  and  by  recalling  that  with  no  forcing, 


ftn  —  (ft  ly^n—l  ft^ftn+1* 

Let  an(t )  =  A(nh ,  th~p+l)  and  send  h  — >  0  with  nh  — >  x  and  th~pJrl  —>  r  to  obtain 

d A  „_■«  .  ndA  . 

A~2xaF-  (47) 

The  previous  equation  can  also  be  written  in  the  conservation  form 


Thus,  one  has 


Id  A2 
2~dr 


~ I <*"’>• 


(48) 


fL  A2(x,r)dx  =  -LpA2(L,t)  -f*  0  if  A2(L,  r)  >  CL~P 
d?  Jo  L— >oo 


1  d 

2 


(49) 


which  is  exactly  the  phenomenon  of  anomalous  dissipation! 
Now  consider  the  characteristic  equation  and  solution  of  (47): 


—  =  -2XP 
dr 


X(0) 


x 


(50) 
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(51) 


A(x,t)  =  Aq(X(t))  exp  (~P  J  Xp  1(Tl)dr^j. 

3.1  The  case  p  —  1 

If  p  =  1,  then  X(r)  =  xe~2r  and  there  is  no  anomalous  dissipation  if  J0°°  Aq(x)cIx  <  oo 
since  energy  is  conserved: 


roo  r  OO  j'OO 

A2(x,r)dx  =  I  A2(xe~2r)e~2rd.x  =  /  ^0(77)^77. 

Jo 


(52) 

/O  Jo  JO 

Anomalous  dissipation  only  occurs  in  this  case  if  Aq(x)  >  Cx~l  as  x  00,  and  in  particular, 
A2{x,r)  =  Cx"1  if  Jlg(x)  =  CaT1. 


3.2  The  case  p  =  2 

If  p  =  2,  then  AT(r)  =  and  anomalous  dissipation  occurs  even  if  f£°  A^(x)dx  <  00 

since 


r  00  /“oo 

|2. 


/t2 


/o  A{x'T)dx~J0  “u\\-\-2tx)  (l  +  2a:r)2 

/•l/2r 


dx 


Aoiv)dv 


/  0 


<  /  Ao{ii)dij. 

Jo 


Notice  also  that 

^2(a;,r)  =  ^ 


1 


0  '  l  +  2rx  7(1  +  2xt)2 
so  that  one  has  anomalous  dissipation  for  r  >  r*,  where 

12 


r*  =  min{r  :  A(,(l/2r)  7^  0}. 


(53) 


Ao(1/2t)(2o:t)  2  as  x-^00 


(54) 


(55) 


3.3  Properties  of  the  Solutions 

It  can  easily  be  seen  that  the  approximating  PDE  is  consistent  with  the  simple  model 
for  p  =  1  since  the  aforementioned  properties  exactly  mirror  those  given  in  the  previous 
section.  We  can  speculate  that  this  is  true  for  all  other  values  of  p  as  well.  Moreover,  since 
the  behavior  of  the  characteristics  completely  determine  the  properties  of  the  IVP  solution, 
by  (50)  one  has  that  solutions  with  p  <  1  behave  like  the  p  ~  1  solution,  and  solutions  with 
p  >  2  behave  like  the  p  =  2  solution  with  regards  to  anomalous  dissipation. 

There  is  an  interesting  analog  between  solutions  of  the  approximating  PDE  for  p  =  2 
and  p  =  1  and  those  of  the  3D  and  2D  Euler  equations,  respectively.  As  discussed  in  [3], 
solutions  of  3D  Euler  with  finite  energy  are  expected  to  dissipate,  as  in  the  case  p  —  2. 
In  contrast,  it  has  been  proved  [4]  that  solutions  to  2D  Euler  (in  which  the  enstrophy 
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Figure  4:  Characteristics  of  the  approximating  PDE  for  (a)  p  =  1  and  (b)  p  =  2. 

5  =  ^|u>|2  =  j|V  x  u|2,  rather  than  the  energy,  cascades  to  small  scales)  conserve  enstrophy 
whenever  the  enstrophy  itself  is  finite,  just  as  in  the  case  p  =  1!  Furthermore,  if  we  define  the 
“turnover  time”  r(fc)  in  the  context  of  3D  Euler  as  the  only  time  scale  which  can  be  derived 
from  the  wavenumber  magnitude  k  =  |k|  and  energy  flux  e  (equivalently,  the  enstrophy  flux 
e  in  2D  Euler),  dimensional  analysis  gives  that 

r(k)  =  Ce~1/3k- 2/3  (3D  Euler),  f(fc)  =  Ci~2/3  (2D  Euler).  (56) 

Since  the  turnover  time  describes  the  time  required  for  energy  (enstrophy)  to  pass  through 
wavenumbers  of  magnitude  k  and  fi(k)  =  ln(fc)  is  the  natural  scale-invariant  measure  as¬ 
sociated  with  /c,  the  total  time  for  energy  (enstrophy)  to  move  from  k  —  l/Lf  to  k  =  oo 
is 

roc  roc 

/  r{k)dp{k)  <  oc  (3D  Euler),  /  r(k)dfi(k)  =  oc  (2D  Euler).  (57) 
Jl/Lf  Jl/Lf 

Analogously,  characteristic  lines  in  the  case  p  =  2  go  to  infinity  in  finite  time,  while  those 
for  p  =  1  go  to  infinity  in  infinite  time  (see  Figure  4)! 

It  is  worth  noting  that  in  the  derivation  of  the  PDE  (47)  we  have  made  some  smoothness 
assumptions  on  solutions  of  the  IVP  in  the  simple  model.  These  assumptions  can  be  justified 
if  a  >  0  but  are  quite  wrong  if  a  <  0.  To  see  this,  let  a  =  —  1  and  consider  the  IVP 
an(0)  —  tn=i  for  p  =  0.  Then  for  0  <  t  <C  1  one  has  that  an  >>  am  for  n  <  m,  and  so 

an{t)  ~  —an_i(i)  for  n  £  N,  with  a>o  =  0. 

This  implies  that  for  small  values  of  £, 

an{t)  ~  (-l)”-1^. 

The  solution  of  the  IVP  is  thus  initially  highly  oscillatory  and  cannot  be  approximated  by 
a  smooth  function  in  any  strong  sense. 
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Vibrating  pendulum  and  stratified  fluids 

Inga  Koszalka 


1  Abstract 

The  problem  posed  is  the  stabilization  of  the  inverted  state  of  a  simple  pendulum  induced 
by  high-frequency  vertical  oscillations  of  the  pivot  point.  The  stability  conditions  are  de¬ 
rived  by  means  of  the  multiscale  perturbation  leading  to  the  averaged  dynamics  as  well  as 
by  linearization.  Then  the  concept  and  methods  are  applied  to  the  study  of  an  incompress¬ 
ible,  inviscid,  stratified  fluid  under  the  Boussinesq  approximation.  The  mechanism  of  the 
stabilization  of  the  fluid  system  was  found  to  be  analogous  to  that  of  pendulum  provided 
that  the  density  disturbance  has  the  form  of  a  wave  or  the  sum  of  waves.  However,  the 
analogy  in  case  of  a  general  density  disturbance  is  not  obvious. 

2  Introduction 

A  simple  pendulum  has  only  one  stable  state,  the  straight-down  position.  However,  if 
its  support  vibrates  in  the  vertical  or,  equivalently,  when  the  gravity  is  modulated  at  a 
frequency  much  greater  than  the  natural  frequency  of  the  pendulum,  then  it  is  also  possible 
for  the  inverted  (upside-down)  position  to  be  a  stable  state.  The  problem  dates  back  to 
1908  when  Stephenson  showed  that  it  is  indeed  possible  to  stabilize  an  inverted  pendulum 
by  subjecting  the  pivot  to  small  vertical  oscillations  of  suitably  high  frequency  ([17],  [18], 
[19]).  However,  it  was  the  work  of  Piotr  Kapitza  ([10])  that  drew  broader  attention  and 
commenced  a  series  of  studies  concerned  with  this  interesting  phenomenon,  called  sometimes 
for  that  reason  ’’Kapitza  pendulum”.  Similar  behavior  of  parametrically  forced  systems  in 
this  parameter  regime  was  found  in  other  problems,  like  particle  trapping  and  even  evolution 
of  market  prices  (e.g  see  [6],  [7]). 

The  purpose  of  this  work  is  to  investigate  the  stabilization  of  the  inverted  pendulum  and 
to  apply  the  concept  and  the  methods  developed  to  fluid  dynamics.  The  pendulum  system 
is  treated  by  means  of  the  multiscale  perturbation  which  leads  to  the  averaged  dynamics, 
as  well  as  by  linearization,  which  reduces  the  problem  to  Mathieu  equation.  As  a  simple 
fluid  analog  we  choose  incompressible,  stratified  fluid  under  the  Boussinesq  approximation 
in  periodic  domain,  subjected  to  a  rapidly  varying  gravitational  field.  We  focus  on  the 
multiscale  technique  and  averaged  dynamics  to  find  the  stabilization  mechanism  equivalent 
to  that  obtained  for  the  Kapitza  pendulum. 
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Figure  1:  Kapitza  pendulum. 

3  Vibrating  inverted  pendulum 

3.1  Problem  formulation.  Equation  of  motion 

We  consider  a  simple,  nonlinear  pendulum  of  mass  m  and  length  Z,  moving  on  a  vertical 
plane  in  the  uniform  gravitational  field  and  subjected  to  a  vertical,  rapid  vibration  of  the 
pivot  point.  By  rapid  vibration  we  mean  oscillation  of  high  frequency  and  small  amplitude 
of  the  pivot  motion,  given  a  form: 


CP)  =  acosfrt).  (1) 

The  parameters  of  the  external  forcing,  the  amplitude  of  the  vertical  motion  and  the  fre¬ 
quency,  obey 

a  ~  O(e),  7  ~0(-). 

€ 

where  €  is  a  small  number.  Following  the  classical  work  of  [11],  we  choose  the  coordinate 
system  depicted  in  figure  1  and  the  following  transformation: 

x  =  Z  sin  (p 

y  =  l  cos  <j)  +  a  cos (7 1),  (2) 

where  4>  €  R/27tZ  is  the  angle  that  the  pendulum  forms  with  the  downward  vertical.  From 
the  Lagrangian  of  the  system  (for  the  derivation,  see  Appendix  A),  we  obtain  the  equation 
of  motion  of  the  vibrationally  forced  pendulum: 

ml2(p  +  mgQl  sin  <f>  +  ma/72  cos (7/)  sin  </>  =  0,  (3) 

where  g0  is  the  gravitational  constant.  It  is  worth  noting  that  by  a  simple  rearrengement 
leading  to: 

l(f>  +  ( 9o  -  072  cos(7 1))  sin  (p  =  0, 

we  can  regard  the  system  of  the  pendulum  with  the  vertically  oscillating  support  as  a 
immotilc  pendulum  in  the  field  of  a  modulated,  rapidly  varying  modified  gravity  of  the 
form : 

1  A 

9  =  9o  +  -g- 1(-). 
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Defing  the  natural  frequency  u>0  of  the  pendulum  by  w2  =  ^r,  we  get  a  more  concise  form 
of  the  equation  of  motion: 


<j)  +  (w2  +  y72  cos(q t))  sin  0  =  0. 


(4) 


It  is  more  convenient  to  operate  with  non-dimensional  parameters.  Without  the  loss  of 
generality,  we  let  u;2  =  1  and  divide  (4)  by  it.  The  nondimensional  time  is  set  to  be  t*  =  u)0t. 
We  define  the  ratio  of  forcing  and  natural  frequencies  Q,  and  the  relative  amplitude  of  the 
forcing  / 3  as,  respectively: 


n-^-7, 

LOn 


o-v 


where  Q  ~  0{\)  and  /3  ~  0(e).  Dropping  asterics,  the  equation  of  motion  of  the  so-called 
’’normalized  pendulum”  becomes: 


<j>  +  (1  +  (3Q 2  cos(Q  t))  sin  0  —  0. 


(5) 


This  is  a  nonlinear  equation  with  periodic  coefficients,  and  is  nonintegrable.  Note  that  it 
describes  a  forced  motion  in  a  uniform  field  of  gravitational  potential  U0 : 

=  —  _  /?fi2  cos(fH)  sin  0,  where  [/O  =  ~cos0  (6) 

ot2  o(p 

The  stability  properties  of  this  system  have  been  studied  either  by  means  of  averaging  (e.g. 
[10],  [11]),  i.e.  effective  potential  method,  or  by  linearization  around  fixed  points  which 
leads  to  Mathieu  equation  (e.  g.  [18],  [8]).  The  information  obtained  by  neither  of  the 
two  approaches  is  complete,  however  it  is  in  some  sense  complementary;  therefore  we  found 
worthwile  to  apply  both  of  them.  We  obtain  the  averaged  dynamics  by  an  alternative  tech¬ 
nique,  the  multiscale  perturbation,  and  then  zoom  into  the  neighborhood  of  the  equilibra 
by  means  of  linearization. 


3.2  ^  problem  and  multiscale  perturbation 

We  note  that  there  are  two  well  separated  time  scales  in  our  problem,  corresponding  to 
the  slow  motion  of  the  pendulum  and  fast  oscillation  of  the  pivot  point.  We  can  therefore 
attempt  to  find  an  asymptotic  solution  valid  on  long  time  scales  of  the  pendulum  motion. 
We  will  define  a  perturbation  parameter  and  its  relation  with  the  forcing  parameters  by: 

c=^,  |e|  «  1,  0=j  =  e0.  (7) 

The  independent  time  scales  in  our  problem  are: 

slow  time:  t  ~  0(1) 

fast  time:  r  =  r  ~  O(-), 

and  so  the  equation  of  motion  (4)  can  be  expressed  as: 


The  first  and  the  second  time  derivatives  become,  respectively: 

d,  d  Id  d2  _  d2  2d 2  Id2 

dt  dt  +  e  dr'  dt 2  dt2  e  drdt  e2  dr2 ' 

Expanding  the  variable  <f>  in  power  series  of  e  yields: 

<p(t,  t)  =  <f>0(t,  t)  +  e<j) i  (t,  r)  +  e2<p2(t,  r)  +  . . .  .  (9) 


Inserting  the  perturbation  series  (9)  into  (8)  and  assembling  powers  of  e  yields  a  set  of 
equations  for  the  subsequent  orders  in  e.  Manipulation  of  these  equations  results  in  a  group 
of  terms  that  give  unbounded,  linear  growth  of  the  solution  in  fast  time  r  which  obviously 
destroys  the  solution  on  long  time  scales.  Such  terms  are  called  secular  terms  and  a  standard 
procedure  in  multiscale  perturbation  technique  is  to  remove  them  by  making  them  equal  to 
zero  and  vanish  ([9]).  For  the  justification  and  more  detailed  treatment  of  the  problem,  see 
Appendix  B.  The  condition  for  the  solution  to  be  valid  uniformly  on  t  gives  the  equation  of 
motion  of  the  leading  order  quantity: 


d2<fto 

dt2 


=  -(1  + 


mf 

2 


cos  (fi0)  sin  (p0. 


(10) 


It  is  worth  to  note  that  the  same  result  would  be  obtained  if  the  average  of  the  0(1) 
equation  over  the  fast  time  has  been  taken,  defined  as  following: 


r+T 


e«T«  1, 


V»(<)  =  f 


1p(t,T')d,T' , 


(11) 


(see  Appendix  B).  Using  the  fact  that  cos 2r  =  i,  we  get: 


d24> 

w 


-(1  + 


ml 

2 


cos  <f>)  sin  4>. 


(12) 


Therefore,  we  can  consider  (10),  governing  the  dynamics  of  the  0(1)  quantity,  to  be  equiv¬ 
alent  to  the  dynamics  of  a  variable  averaged  over  the  fast  oscillations  (12). 


3.3  Averaged  dynamics.  Effective  potential. 


We  can  note  the  averaged  dynamics  of  the  vibrationally  forced  pendulum  governed  by  (12), 
can  be  perceived  as  a  motion  in  the  field  of  effective  potential  U. : 


<920  __  3U 

W  = 


where 


U  —  —  cos  <f>  + 


P2Q2 

4 


sin 


,2  T 


(13) 


Interestingly,  there  is  no  explicit  time  dependence  in  the  governing  equation,  as  the  effective 
potential  is  dependent  only  on  the  mean  state  of  the  system  and  parameters  of  the  forcing 
(compare  with  (6)).  This  observation  allows  for  the  following  physical  explanation  of  the 
process,  given  by  [10].  Vibrational  gravitational  field  (induced  by  the  oscillatory  motion  of 
the  pivot)  leads  to  the  production  the  vibrational  torgue  which  on  average  manifests  as  an 
ordinary  force.  This  force  tends  to  set  the  rod  of  the  pendulum  in  the  direction  of  the  axis  of 
the  oscillations.  Provided  the  vibrational  force  is  balanced  by  the  gravity,  the  inverted  state 
exhibits  the  dynamical  equilibrum  and  becomes  stable.  We  can  now  express  the  stability 
condition  for  the  upper  equilibrum  in  terms  of  the  parameters  of  the  forced  system.  Let’s 
take  a  closer  look  into  the  stability  of  the  system. 
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unforced  pendulum 


vertically  forced  pendulum 


Figure  2:  Left:  Potential  as  a  function  of  <j>  for  the  unforced,  nonlinear  pendulum.  Right: 
effective  potential  for  vibrationally  forced  pendulum.  Note  that  scales  of  the  two  figures  are 
set  different  show  the  shape  of  the  functions. 

3.4  Stability  of  equilibria  of  averaged  dynamics 

The  stability  of  equilibra  can  be  most  easily  determined  from  (13).  By  setting  the  right 
hand  side  to  zero,  we  find  positions  of  equilbra,  i.e  the  extrema  of  U.  There  are  four  such 
points,  in  contrary  to  the  case  of  the  unforced  nonlinear  pendulum,  when  there  are  only  two 
(fig.  2).  The  nature  of  the  extremum  is  determined  by  the  sign  of  the  second  derivative  of 
U)  <  0  indicates  maximum  (i.e.  an  unstable  equilibrium),  >  0,  minimum  (a  stable 
equilibrum  called  a  potential  well): 

Equilibrum  Stability 

</>*  —  0  (noninverted)  stable 

<f> 2  =  arccos(— )  unstable 

</>3  =  arccos(— —  ^-— )  unstable 

(j)\  —  7r  (inverted)  conditionally  stable 


209 


Figure  3:  Time  series  and  phase  plots  of  the  stabilized  inverted  pendulum.  Blue  color 
indicates  the  nonaveragcd  and  red  -  averaged  dynamics.  Left:  stable  configuration  = 
11,  /?  =  0.2.  Right:  unstable  configuration  (rotational  mode)  Q  =  11,  (3  =  0.1.  Note  that 
the  scales  of  the  phase  space  plots  are  set  different  to  show  the  shape  of  trajectories. 


Figure  4:  Time  trace  of  the  vibrationally  forced  pendulum  in  physical  space.  Left:  stable 
configuration  Q  —  11,  [3  =  0.2.  Right:  unstable  configuration  fi  =  11,  /?  —  0.1. 


Prom  (13),  the  condition  for  the  stable  upper  equalibrum  (f> \  is: 


/ 32n 2 
2 


>  1. 


(14) 


The  stability  of  the  inverted  position  depends  solely  on  the  parameters  of  the  system  and 

requires  a  forcing  of  suitably  high  frequency.  Note  also  that  an  angle  <^3  “  arccos(~  . ) 

can  be  interpreted  as  a  width  of  the  potential  well,  namely  maximum  initial  displacement 
that  allows  for  the  stabilization  of  the  upper  equilibrum  for  given  properties  of  the  forcing. 
Exemplary  time  series  and  phaseplots  of  the  pendulum  in  a  stable  and  unstable  regime 
is  presented  on  figure  3,  for  both  nonaveraged  and  averaged  dynamics.  Corresponding 
behavior  of  the  pendulum  in  the  physical  space  is  shown  in  figure  4. 

There  is  another  point  of  view  of  the  averaged  dynamics.  The  problem  of  the  invert¬ 
ible  pendulum  may  be  considered  in  the  1  ^-degree-of-freedom  Hamiltonian  setting.  The 
nonlinear  dynamics  is  then  described  by  the  Poicare  map  or  equivalently  by  its  integrable 
approximation,  a  planar  Hamiltonian,  obtained  from  the  normal  form  theory:  successive 
transformations  leading  to  the  removal  of  the  explicit  time  dependence.  As  the  effective 
potential  corresponds  to  the  potential  energy,  the  planar  Hamiltonian  is  equal  to  the  total 
energy  of  the  averaged  system.  In  this  framework,  by  defining  a  parameter  A  =  the 

transition  of  the  inverted  equilibrum  from  a  minimum  for  A  <  1  to  a  saddle  point  for  A  >  1 
may  be  considered  a  subcritical  Hamiltonian  pitchfork  bifurcation  ([5]),  see  figure  5. 


Figure  5:  Bifurcation  diagram  in  the  (A,</>*)  -  plane  for  the  inverted  pendulum,  where 
A  =  and  (ft*  is  a  fixed  point.  Green  dots  indicate  instability,  magenta  dots  -  stability. 

3.5  Mathieu  equation 

Another  approach  to  the  study  of  the  inverted  pendulum  is  the  linearization  of  the  dynamics 
near  an  equilibrum.  We  define  a  =  ^  and  look  again  into  the  dynamics  of  (5),  but  now  as 
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evolving  in  the  fast  time  r  =  7^: 

<t>TT  +  (a  +  P  cos  t)  sin  <j>  =  0.  (15) 

Now  we  zoom  into  the  dynamics  near  7r,  so  it  is  convenient  to  define  the  complementary 
angular  displacement  tp  with  respect  to  <£,  p  =  n  —  (j).  This  is  the  angular  displacement 
from  the  upper  equilibrum  position.  Then  sin  <p  —  sin(7r  —  </>)  =  +sin  (j)  and  <pTT  =  — </>TT. 
The  governing  equation  becomes: 


<pTT  —  (a  +  j3  cos  t)  sin  p  =  0. 


(16) 


The  sign  of  (3  is  not  important  as  it  corresponds  to  the  instantenous  amplitude  of  the  pivot 
motion  around  the  center  of  the  coordinate  system,  which  can  be  postive  or  negative.  The 
sign  of  O',  however,  matters.  We  define  the  variable  &s 


<ip  = 


angular  displacement  near  the  lower  equilibrum 
angular  displacement  near  the  upper  equilibrum 


and  linearize  sin  xj)  ^  obtaining  the  canonical  form  of  Mathieu  equation: 


d 2 'll) 
dr 2 


+  (a  +  /?cos  r)rj)  =  0. 


(17) 


Thus  equation  (17)  describes  linearized  dynamics  around  the  either  fixed  point,  depending 
on  the  sign  of  g: 


a  >  0  :  lower  equilibrum  q  <  0  :  upper  equilibrum  (18) 

This  specific  formulation  allows  the  investigation  of  the  linear  stability  near  the  either 
equilibrum  based  on  the  general  results  for  the  Mathieu  equation.  It  is  also  a  manifestation 
of  the  fact  that  we  can  tackle  the  stability  of  the  inverted  state  by  changing  the  sign  of 
the  gravity  g.  The  stability  of  the  periodic  solutions  of  the  Mathieu  equation,  given  by  the 

Floquet  theory,  can  be  determined  from  the  diagram  in  (/3,  a)  -  parameter  space  ([9]).  The 
so-called  transition  curves  separate  regions  of  values  a  and  /?  corresponding  to  unstable  and 
stable  solutions.  Kapitza  pendulum  regime,  with  high  forcing  frequency  (small  a)  and  small 
amplitude  of  the  pivot  motion  (small  /?)  lies  in  the  region  marked  by  the  red  circle  in  fig  (6). 
The  most  surprising  is  the  fact  that,  contrary  to  the  averaged  dynamics,  linearized  approach 
gives  the  possibility  for  destabilization  of  the  lower  equilibrum!  We  can  obtain  quite  exact 
values  for  the  parameter  regions  corresponding  to  stable  inverted  and  noninverted  equilibra: 


Equilibrum 

Stability  condition 

inverted 

$  </?<0.45+^ 

noninverted 

O<0<  0.45 
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It  is  worth  noting,  that  the  linear  stability  condition  for  inverted  pendulum  based  on  lin¬ 
earization  was  generalized  to  an  inverted  JV-pendulum  and  proven  by  [1].  In  that  case  the 
system  can  be  reduces  to  N  uncoupled  Mathieu  equations  and  the  stabilty  condition  yields 


\/2 g  ^  0.450g 

^  P  ^  o  > 

If^min  & max 


(19) 


where  Wmin  and  u?max  are  the  lowest  and  the  highest  of  the  natural  frequencies  of  any  single 
pendulum  member  of  the  full  configuration. 


Figure  6:  Left:  Stability  diagram  for  Mathieu  equation.  Right:  the  Kapitza  pendulum 
regime. 


3.6  Averaged  dynamics  vs  Mathieu  equation 

Both  approaches  to  the  study  of  the  dynamics  of  the  pendulum  system  under  the  influence 
of  a  rapidly  oscillating  forcing  are  not  completely  adequate,  i.e  the  relevance  of  either  of 
them  is  limited.  Here  we  summarize  the  key  points  concerning  the  applicability  of  the  two 
methods. 

Averaged  dynamics: 

•  approximates  long  time  asymptotic  behavior, 

•  concerns  global  dynamics,  i.e  arbitrary  displacements  from  the  equilibrum  position. 
Hence,  apart  from  the  very  proximity  of  the  inverted  state,  allows  the  study  of  the 
rotating  state  of  the  pendulum, 

•  gives  only  lower  stability  bound  in  the  parameter  space  for  the  inverted  equilibrium, 

•  gives  the  maximum  angular  displacement  that  permit  stabilization  of  the  inverted 
state  for  a  given  set  of  parameter  values, 
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•  predicts  the  stability  of  the  lower  equilibrum  for  all  the  values  of  the  parameters, 

•  gives  physical  explanation  for  the  dynamical  stabilization  phenomenon  in  terms  of  the 
effective  potential. 

Certainly  some  subtle  details  in  the  pendulum  dynamics  are  lost  in  the  approximate  average 
analysis,  which  refers  only  to  the  slow  component  of  the  motion.  In  this  method  one 
introduces  an  approximation  with  no  control  on  the  relevance  of  the  discarded  dynamics, 
except  of  the  estimate  of  their  magnitude  in  terms  of  e.  There  is  always  a  region  in  parameter 
space  where  the  averaging  fails  predicting  unstable  configuration  -  the  region  over  the  upper 
bound  in  case  of  the  inverted  state  and  the  whole  unstable  region  for  the  lower  equilibrum, 
which  can  be  found  from  the  Mathieu  equation. 

Linearized  theory: 

•  reduces  the  problem  to  the  well-known  Mathieu  equation, 

•  applies  only  to  the  vicinity  of  the  fixed  points,  i.e  is  valid  only  for  small  angular 
displacements  from  the  equilibrum  position, 

•  gives  more  precisely  defined  stability  regions:  lower  and  upper  bounds  on  the  param¬ 
eter  values, 

•  admits  the  possibility  for  destabilization  of  the  lower  equilibrum  and  gives  the  range 
of  parameter  values  for  which  it  should  be  observed, 

•  does  not  provide  with  any  physical  explanation  of  the  phenomenon. 

In  general,  the  applicability  of  the  Mathieu  equation  is  limited  as  in  general  the  linear 
stability  cannot  be  extended  immediately  to  the  full  system.  However,  it  gives  correct 
results  for  the  zoomed  view  into  the  dynamics  near  the  equilibra. 

4  Kapitza  pendulum  in  fluid  systems 

4.1  Kapitza  pendulum  vs  fluid  systems 

As  already  mentioned,  we  can  regard  a  pendulum  with  a  vertically  oscillating  support  as 
equivalent  to  a  pendulum  with  a  stationary  support  in  a  periodically  varying  gravity  field. 
This  observation  allows  to  attempt  at  finding  an  analogy  between  the  Kapitza  pendulum 
and  fluid  systems  acted  upon  by  vibrating  gravitational  forces,  and  thus  study  the  dynamics 
of  the  latter  by  methods  developed  in  the  preceding  sections.  A  similar  analogy  was  already 
mentioned  by  Lord  Rayleigh  ([15]),  who  considered  a  phenomenon  observed  in  the  famous 
Faraday  experiment:  a  surface  wave  instability  in  a  vertically  vibrated  container  filled  with 
fluid,  with  the  frequency  close  to  resonance  with  the  natural  frequency  of  the  system. 
From  that  time  on  the  problem  has  been  studied  widely  both  analytically  (e.g.  [2],  [12], 
[13])  and  experimentally  (e.g.  [4]).  The  "Kapitza  regime”  discussed  here  is  different  as 
the  forcing  frequency  is  much  higher  then  the  natural  one  and  it  would  not  be  trivial  to 
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think  of  an  experiment  similar  to  that  of  Faraday.  One  can  rather  search  for  an  idealized 
physical  situation.  We  have  chosen  a  simple  fluid  system  described  by  Boussinesq  equations, 
characterized  by  density  stratification  and  related  buoyancy  force,  which  action,  combined 
with  that  of  gravity,  provides  the  mechanism  for  inertial  oscillations  with  bounded  natural 
frequency. 


4.2  Boussinesq  equations.  Problem  formulation 


We  will  study  the  dynamics  of  a  incompressible,  inviscid,  stratified,  hydrostatic,  non¬ 
rotating  fluid,  governed  by  nonlinear  Boussinesq  equations  (the  momentum  equations,  con¬ 
tinuity  and  conservation  of  density),  see  e.g.  [16]): 


Du  1  dp 
Dt  p  dx 


Dw  1  dp 
Dt  pdz 


9P 

P 

dw 


du 

dx  dz 
Dp 


Dt 


=  0 

(20) 

=  b 

(21) 

=  0 

(22) 

=  0 

(23) 

Here  t  is  time  and  u  and  w  are  the  components  of  the  nondivergent  velocity  field  in  horizon¬ 
tal  (x)  and  vertical  ( z )  directions  in  a  Cartesian  frame  of  reference.  The  positive  ^-direction 
is  antiparallel  to  gravity.  We  will  work  in  the  (x,  z)  -  plane,  which  implies  that  all  charac¬ 
teristics  of  the  system  are  uniform  in  y . 

We  consider  a  hydrostically  balanced  reference  state  upon  which  perturbations  are  to  be 
imposed  and  make  use  of  the  Boussinesq  approximation,  which  means  that  the  density  field, 
represented  by  p(x,  2,  t)  =  p  +  p0(z)  +  Ap(x,  z,  t)  =  p(l  —  <S(^))  +  A p(x,  2,  t),  satisfies  p  » 
Po(z)  >>  A p.  The  mean  fluid  density  p  is  a  uniform  costant.  The  density  stratification  is 
assumed  to  be  linear  and  S  is  a  stability  parameter  defined  as: 


{+1  stable  stratification 
—1  unstable  stratification 

The  vertical  extent  of  the  domain  is  small  compared  to  a  density  depth  scale  H  = 

Pressure  field  p  is  assumed  to  be  hydrostatically  related.  The  joint  effect  of  gravity  and 
density  stratification  leads  to  a  buoyancy  force  b  in  the  vertical.  The  natural  frequency  of 
the  system,  so-called  the  Brunt-Vaisala  frequency,  is  defined  through  N2(z)  =  —  (f  1*f)j 
which  is  constant  in  case  of  linear  stratification,  and  with  notation  used  here  TV  =  We 
will  constraint  us  to  the  flows  with  periodic  boundary  conditions. 

Nondivergence  of  the  velocity  field  allows  to  introduce  a  stream  function  such  that 
u  =  %j)z  and  w  =  In  an  infinite  medium  the  Boussinesq  equations  are  satisfied  by 

planar  internal  waves  of  the  form  ^  cos [kx  +  mz)  e~iu)t  (with  horizontal  and  vertical 
wave  numbers  k  and  m  and  k  =  y/k2  +  m2),  which  obey  the  dispersion  relation  u  =  ±7Vj^. 
In  polar  coordinates  chosen  so  that  k  =  k  cos  a  and  m  =  k  sin  a  we  have  to  =  ±7V  cos  a.  The 
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frequency  is  therefore  a  function  of  the  angle  that  the  wave  vector  makes  with  the  vertical. 
The  condition  for  progressive  waves  is  therefore  0  <  u)  <  TV  so  N  acts  as  the  upper  bound 
of  internal  wave  frequencies,  corresponding  to  entirely  vertical  flow  (buoyancy  oscillation) 

([3])- 

Now  we  will  embed  the  system  in  a  modified  gravity  field,  varying  in  vertical  as  g(t)  — 
9o  +  o,72cos(7 i),  where  gQ  is  a  gravitational  costant,  the  amplitude  of  oscillatory  motion 
satisfies  a  «  gQ ,  and  the  frequency  of  the  oscillations  7  is  much  higher  then  the  Brunt- 
Vaisala  frequency,  that  is  7  >>  N. 

We  will  nondimensionalize  the  system  as  follows: 


H),  t*  - 


which  results  in  the  nondimcnsional  components  of  the  velocity: 

u  w  v 


(«W) 


W9JI1  s/9oH 

The  nondirnensional  density  and  pressure  fields  are: 


A  p 


P 


p*(x,z,t)  =  (l +  Ap*(x,z,t),  where  Ap*  =  — ,  and  p*  =  _  , 

H  p  pg0H 

which  gives  the  nondimcnsional  Brunt-Vaisala  frequency  N*2  =  S.  By  introducing  the 
nondirnensional  parameters: 

O'  0/ 

« =  h 

9o 


the  ratio  of  the  forcing  and  natural  frequencies  and  the  relative  magnitude  of  the  forcing, 
respectively,  we  get 

g*(t)  =  £^1  =  1  +  /3Q,2  cos(Qt*).  (24) 

9o 

By  dropping  asterics,  the  nondirnensional  Boussinesq  equations  are: 


Du  dp 
~Dt  +  dr. 

=  0 

(25) 

Dw  dp 
~Dt  +dz 

=  —  A/?(l  +  /?fi2  cos(fi/)) 

(26) 

du  dw 

(27) 

dx  dz 

=  0 

dA  p 
dt 

dA  p 

,  dA  p 
+  U)  .  wo 

oz 

=  0 

(28) 

We  can  eliminate  the  pressure  by  focusing  on  the  vorticity  equation,  which 

Dq  D  ,du  dw  N  dA p  t 
Dt  =  ^  =  +  PSl  COS(m))- 


is: 


(29) 


It  is  evident  that  the  vibrating  gravity  field  modifies  the  basic  mechanism  of  the  baro- 
clinic  generation  of  vorticity.  We  will  now  investigate  this  phenomenon  in  detail,  using  the 
methods  derived  for  the  stabilized  inverted  pendulum. 


216 


4.3  Multiscale  expansion 

We  can  observe  that,  similarly  to  the  case  of  the  Kapitza  pendulum  discussed  above,  in  our 
Boussinesq  problem  we  have  two  well  separated  time  scales:  the  slow  gravity  oscillations 
and  the  fast  oscillation  of  the  gravity  field.  Thus,  an  attempt  to  find  an  asymptotic  solution 
valid  on  long  time  scales  is  justified.  Analogously  as  for  the  pendulum,  we  can  define  a 
perturbation  parameter  by  e  =  and  we  have  / 3  ~  0(e).  The  slow  time  is  then  t  ~  0(1), 
and  the  fast  time  r  =  fit,  r  ~  0(~),  compare  with  (7).  The  oscillating  gravity  field  (24) 
can  be  expressed  as  g(t)  =  1  +  Unlike  in  the  pendulum  case,  system  variables 

depend  now  not  only  on  time,  but  also  on  space,  therefore  we  have  for  the  first,  the  second 
and  material  time  derivatives,  respectively: 

8  d_  1  d_  cP_  2JP_  D_  _  d_  d_ 

dt  dt  ^  e  dr 5  dt 2  <9 t2  ^  e  drdt  e2  dr 2  ’  Dt  dt  +  dx  +  dz 

With  the  averaging  operator  defined  as  (11),  the  components  of  the  velocity  perturbation 
are  expanded  in  perturbation  series  as: 

u(x,z,t,r)  =  vZ(x,z,t)  +  u'0(x,z,t,r)  +  ...  Q 

w(x ,  z,  i,  r)  =  Zj  t )  +  w'0(x ,  z,  t:  r)  +  ....  , 

where  u U)  refer  to  the  mean  perturbation  velocity,  while  (u'0,  w'0)  correspond  to  the 

disturbance  due  to  modified  acceleration  of  the  order  0(-);  after  time  integration,  we  can 
expect  them  to  be  of  the  order  0(1),  but  as  they  came  from  the  oscillatory  motion,  they  will 
vanish  when  averaged  over  the  fast  time.  We  will  define  the  mean  substantial  derivative: 

D  d  _d  _ d 
Dt  ~  dt+U°dx+W°dz' 

The  density  perturbation  is  expanded  in  e  as: 


A p(x,  z ,  t ,  t)  =  A p0(x,  z,  f ,  r)  +  eApi  (x,  z,  t,  r)  +  ...  . 


(31) 


Inserting  the  series  (31)  and  (30)  into  the  density  equation  (28),  we  get: 


8Ap0  1  dAp0  dApi  dApj  _ dAp0  ,  dAp0  _ _ dApi  ,  dApi 


dt 


+ 


dr 


“h  € 


dt 


+ 


dr 


4-  uQ 


,  _ dApa  ,  8Ap0  _ dApi 

+  w0—7r~  +  w0— - h  w0— - h  w, 


dx 


dx 


dx 


dx 
,,  dA pi 
dx 


+  u\ 


0  dx 
w^S  -  w'S 


T  wp  ~ 


dx 


■+t£ 


dx 


:+ 


0, 


(32) 


Gathering  terms  of  the  order  0(-)  we  conclude  that  A p0  =  A p0(x,z>t).  In  the  order 
of  0(1),  we  will  first  average  out  the  terms  containing  perturbation  quantities  A p\  and 
( ufQ ,  w'0),  so  that  the  remaining  terms  give  an  evolution  equation  for  the  mean  density 
perurbation: 

£^--w-oS  =  0.  (33) 

Subtraction  of  (33)  from  (32)  results  in  an  evolution  equation  for  Ap^: 


dA  pi 
dr 


+  < 


d^Po 

dx 


+  < 


dApc 

dz 


-  w'0S  =  0. 


(34) 
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Inserting  the  perturbation  series  (30)  into  (29),  we  get: 


d  .du;  dwl.  1  d  ,du'  dw'  _d,dxr0  &u%s  _d,du;  dwZ, 
fa" )  +  7  v  ~aZ  S7~)  +  ^7  ( "«7  a7~)  a-,  ~~  a„)+ 


dtK  8 

+  u, 


dx '  dz 


e  drx  dz  dx 
d  ,du'0  dw'0 x  _ d  ,du'0  dwA  ,  d  ,duZ 


dx '  dz 
d  ,du‘ 


’dzK  dz 


dx 


rua  dw1  d  ,du'0  dw' 

°dx[  dz  dx>  °dz '  dz  dx] 


\ 'dx 
d~A~P0 


dx 

Ju 

Tz 


‘dzK  dz 


dx 


dw0  s  ,  ,  d  ,du0 

^+W°d~z^ 


dwc 

dx 


)+ 


dx  e"-1'  '  dx 
Terms  of  the  order  O(-)  gather  in  the  evolution  equation  for  qfQ : 


1.  ,  ,0Ap  _  ,  n9A/9i  dA pi 


dx 


dx 


dq’o  _  d  ,du'0  dw’0 ^  r_\dAPo 


(35) 


By  applying  the  averaging  operation  to  the  0(1)  ensamble,  terms  containing  products  of 
the  leading  order  and  prime  quantities  are  eliminated  and  the  evolution  equation  for  the 
mean  perturbation  vorticity  follows: 


Dq 

Dt 


dA Po 
dx 


,8Api .  .v 

+  (_ars-l(T)) 


u 


'aA+w'dA 


dx 


d i 


'-)■ 


(36) 


We  will  use  the  fact  that  averaging  is  a  linear  operator  so  that  from  the  continuity  equation 
(28)  to  obtain: 


dut o  dw0 
dx  dz 


and 


du'o  dw'0 
dx  dz 


(37) 


To  summarize,  we  have  arrive  at  two  separated  sets  of  equations  with  no  explicit  time 
dependence:  for  averaged  perturbation  variables  and  those  generated  by  the  forcing  of  our 
system.  By  analogy  to  the  Kapitza  pendulum,  a  stable  stratification  corresponds  to  the 
stable  equilibrium  of  the  pendulum,  while  the  unstable  stratification  -  to  the  inverted  state. 
Now  we  will  analyze  the  response  of  the  system  to  the  instantenous  density  disturbance  to 
determine  how  the  vertical  oscillations  of  the  gravity  field  influence  stability  properties  of 
the  system  as  a  whole. 


4.4  Stability  of  the  vibrating  Boussinesq  system 
4.4.1  Monochromatic  wave  disturbance  in  x  direction 

We  will  start  with  a  perturbation  of  the  mean  state  of  the  form: 

A  p  =  p0cos  (kx).  (38) 


Assuming  a  perturbation  strcamfunction  of  the  form  —  4^  cos(fc.x),  from  (35)  we  have 
uf0  —  0  and  wfQ  =  wfQ(x)  =  ^Ap^x  f  .  Therefore  in  the  (35),  the  advection  terms 

cancel  out  yielding  =  Swf0  and  they  vanish  in  the  equations  for  the  evolution  of  the 
mean  density  perturbation  (33)  the  mean  vorticity  perturbation  (36)  to  give: 


Dq  =  dAp£  smi  DAPo  _ 

Dt  dx  1  2  h  Dt 


(39) 
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Manipulation  of  these  expresions  yields  the  evolution  equation  for  the  mean  density  pertur¬ 
bation: 

2 _ 

—jrfif  +  S*AP =  °»  where  5*=,5+^y_>  (40) 

is  the  new  modified  nondimensional  frequency,  expressed  by  means  of  the  stability  parameter 
(note  that  <S2  =  1  irrespective  of  the  initial  stability).  Thus  we  conclude  that  in  case  of 
a  stable  initial  stratification  (S  =  1),  the  stability  is  augmented,  while  in  case  of  unstable 
initial  stratification  there  is  a  stabilizing  effect  of  the  vertically  oscillating  gravity  field.  The 
stability  condition  in  the  initially  unstable  case  is: 


2 


>  1, 


(41) 


which  is  identical  to  the  analogous  condition  for  the  inverted  Kapitza  pendulum  (14).  In 
fact,  the  equation  (40)  itself  may  be  perceived  as  a  linear  analog  of  (12)  -  unintentionally 
linearized  by  the  dynamics  itself.  We  can  thus  expect  all  the  results  obtained  from  the 
Mathieu  equation  for  the  inverted  pendulum  to  be  valid  in  the  problem  discussed  here. 

Assuming  the  plane  wave  solution  of  the  form  0  =  'ip0cos(kx  —  cut),  we  obtain  the 
dispersion  relation  for  the  internal  waves  supported  by  our  system: 


c J 


2 


which  is  equivalent  to  the  dispersion  relation  of  the  buoyancy  oscillation  typical  to  the 
unforced  Boussinesq  system,  with  N  replaced  by  S*,  i.e.  the  system  is  stabilized  and  the 
frequency  of  the  vertical  oscillation  is  higher  then  the  maximum  frequency  in  the  unforced 
case. 


4.4*2  Monochromatic  plane  wave  disturbance 

In  case  the  initial  density  perturbation  has  a  form  of  the  planar  wave: 

Ap  =  Qo  cos (kx  +  mz),  (42) 


the  procedure  is  conducted  analogously  as  in  the  previous  case.  Although  v!0  ^  0,  from 
the  continuity  equation  (37)  we  have  Uo^,  =  =  uo^  =  =  0  leading  to  the 

cancellation  of  the  advection  terms.  Consequently,  the  multiscale  technique  and  averaging- 
leads  to  the  following  stability  condition  for  the  averaged  system: 


d2  (  d2 

Dt2^dx2 


+ 


dz 2 


where  S 2 


5  I  (  —  )£& 


As  before,  we  conclude  that  in  case  of  a  stable  initial  stratification  the  stability  is  ampli¬ 
fied,  while  in  case  of  the  unstable  initial  stratification,  the  vibrating  gravity  results  in  the 
stabilizing  effect  whenever: 


( 


k 2 


k 2  +  m2 '  2 


)^>i, 


(43) 
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which  is  equivalent  to  (41),  except  that  now  the  orientation  of  the  perturbation  in  space 
(i.e.  matters.  The  dispersion  relation  in  this  case  is: 


kf 

k 2  +  m2 


(44) 


Again,  the  form  of  the  dispersion  relation  is  equivalent  to  the  dispersion  relation  of  the 
internal  gravity  wave  solutions  to  the  unforced  case,  with  N  replaced  by  5*.  The  effect 
of  the  vibrational  gravity  forcing  depends  not  only  on  the  values  of  the  parameters  of  the 
forcing,  but  also  on  the  angle  of  the  wavevector  k:  for  a  given  forcing  properties,  vertical 
disturbances  are  more  difficult  to  suppress. 


4.4.3  Perturbation  of  an  arbitrary  form 

If  we  allow  the  instantenous  density  disturbance  to  have  an  arbitrary  form  A p(x,  z),  the 
advective  terms  in  (33,  34)  and  (36)  do  not  cancel  out  and  the  mean  vorticity  equation 
takes  a  form  of  an  integro-differential  equation: 


Dq 

Dt 


dAp 

dx 


,dApi 

+  '  dx 


s_i)-j(fvV). 


(45) 


Without  further  assumptions,  it  is  difficult  to  construct  any  meaningful  stability  condition. 
However,  if  we  could  represent  A p  in  Fourier  series  and  linearize  in  uf0,w'0  and  A p,  we  can 
get  rid  of  the  advective  terms  and  obtain  the  following  form  of  the  mean  vorticity  equation: 


Dq 

Dt 


(46) 


which  shows  the  additive  effect  of  any  single  perturbation  wave  component  of  the  series  to 
the  baroclinic  generation  of  the  mean  vorticity,  modified  by  the  vibrating  gravity  in  similar 
way  as  in  the  previous  simpler  cases.  The  corresponding  stability  condition  for  an  unstable 
initial  stratification  is: 


p2n2 

2 


£( 


)>r 


(47) 


so  again  we  can  see  that  provided  the  instantenous  density  perturbation  can  be  given  a  form 
of  the  sum  of  waves,  there  is  an  analogy  between  the  effect  of  the  vibrational  forcing  on  the 
stability  of  the  system  in  case  of  the  Kapitza  pendulum  and  dynamics  of  a  fluid  described 
by  ideal  Boussincsq  equations,  with  a  modification  due  to  nonlocality  of  the  problem:  for  a 
given  values  of  forcing  parameters  the  stability  is  strongly  affected  by  the  direction  of  the 
propagation  of  the  density  disturbance. 


5  Summary  and  conclusions 

In  this  work  we  considered  the  inverted  pendulum  with  the  vibrating  support.  The  applica¬ 
tion  of  multiscale  perturbation,  leading  to  the  averaged  dynamics,  as  well  as  linearization, 
allowed  us  to  study  the  stabilization  phenomenon.  Then  we  used  the  multiscale  technique 
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and  averaging  to  an  incompressible,  inviscid,  linearly  stratified,  nonlinear  Boussinesq  system 
in  a  periodic  domain,  subjected  to  rapidly  oscillating  gravity  field.  We  have  shown  that, 
provided  the  instantenous  density  perturbation  can  be  given  a  form  of  a  wave  or  sum  of 
waves,  the  stabilization  mechanism  induced  by  the  vibrational  forcing  is  analogous  to  that 
exhibited  by  the  Kapitza  pendulum.  However,  the  dynamics  of  the  Boussinesq  system  is 
more  complicated,  as  it  evolves  not  only  in  time,  but  also  in  space.  The  resulting  stability 
condition  for  the  initially  unstable  configuration  is  modified:  it  requires  not  only  suitably 
high  frequency  and  small  amplitude  of  the  vibrating  motion  -  the  direction  of  propagation 
of  the  density  perturbation  in  the  space  also  plays  a  role.  In  case  of  a  disturbance  of  a  gen¬ 
eral  form  it  is  difficult  to  draw  the  conclusions  about  the  system  stability  without  further 
assumptions. 

The  work  presented  here  is  not  just  an  idealized,  educative  example  that  contributes  to 
the  understanding  of  the  instability  phenomena.  There  are  indeed  real  physical  situations 
that  permits  the  use  of  Boussinesq  approximation  with  the  forcing  regime  as  prescribed 
here,  though  certainly  requiring  adequate  boundary  conditions  and  generally  more  complex 
analysis.  As  an  example,  we  give  convective  phenomena  in  radially  pulsating  stars,  treated 
in  ([14])  by  means  of  linearization.  The  appeal  of  the  multiscale  perturbation  and  averaging 
methods  discussed  here  is  that  they  provide  the  description  of  the  global  behavior  of  the 
averaged  variable,  expected  to  be  the  one  related  to  any  observed  quantity.  This  observation 
strongly  encourages  the  application  of  these  methods  in  any  future  investigation  of  the 
stability  mechanisms  in  more  realistic  and  complex  fluid  systems  forced  parametrically. 
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7  Appendix  A 

In  the  coordinate  system  (2),  the  kinetic  energy  of  the  system  is  expressed  as: 

T  =  i?n(x2  +  y2)  =  ~ m(l2(p  -f  a272  sin2(7i)  +  2a/7sin(7t)  sin  (fxp)  = 

=  \ml24)  -f  map2  cos(7 1)  cos  (j)  +  \ma2 72  sin2(7^)  —  Unapsin^t)  cos(<£)l , 

2  2  dt 

and  the  pot  ential  is  U  =  —nig0y  =  —mg0l  cos0.  The  Lagrangian  of  the  system  is  therefore: 

L  =  T  —  U  =  —  77/./2<^>  +  map2  cos(7?,)  cos  <p  -f  mg0l  cos  <f  +  [777.0/7 sin (7^)  cos ((f))  +  ^may  sin2 (7^)] . 
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The  complete  time  derivative  on  RHS  does  not  enter  the  action,  so  from  the  Lagrange  equation: 

d  fdL\  dL  „ 


dt\d<j)J  d(j) 

we  obtain  the  equation  of  motion  of  the  vibrationally  forced  pendulum  (3): 

rrd2(j)  +  mg0l  sin  <j>  4-  maly 2  cos(7t)  sin  (j)  =  0. 

8  Appendix  B 

Inserting  the  perturbation  series  (9)  into  (8)  gives: 

d^d)  (3  -  ~ 

— ^  =  —  sin  cj)0  —  t(j) i  cos  cos  <f>0  -  —  cos  r  sin  (j>0  —  (3(j)\  cos  r  cos  <j)0  —  cos  r  cos  <p0 

dt z  € 

A  typical  initial  condition  we  can  think  about  is  the  initial  displacement  from  the  inverted  position 
with  zero  angular  velocity: 


4>o\t=o  = 

t= 0 


d<f>0 1  .  I  d(f)0  . 

-T-U=0  H - t=0  =  U. 

dt  r=0  €  ar  r=0 


And  the  absence  of  the  perturbed  quantities  (no  fast  initialization): 

i  |  d(j)n .  1  d(pn 

0n  *=o  =  1,  “rrU=o  H - —  t=o  =  0,  n  >  0. 

r= o  dt  t— o  €  dr  r=o 

By  assembling  powers  of  e  one  obtains  equations  for  the  subsequent  orders: 

O(i)  :  = -/3  cost  sin 

0(1)  :  sin <pa  cos t cos 


=  —  (3  cost  sin  <£o(£), 


Let’s  look  at  them  in  detail.  The  integral  of  the  first  of  them  is: 

/r\t  ^  \  „  ^  4*0  n  /  _ rp  i 


°<?> 


=  0,  4>o  =  F0(t)r +  Go(t). 


This  gives  unbounded,  linear  growth  of  the  solution  in  fast  time  r  which  obviously  destroys  the 
solution  on  long  time  scales.  Such  terms  are  called  secular  terms  and  a  standard  procedure  in 
multiscale  perturbation  technique  is  to  remove  them  by  making  them  equal  to  zero  and  vanish  ([9]). 
This  argument  can  be  justified  by  using  the  initial  conditions  (48),  from  which  we  get  F0(t)  —  0  and 
G0(t)  =  A0.  Therefore,  the  leading  order  solution  is  equal  to  A0(t),  a  function  of  long  time  t  but  a 
constant  with  respect  to  r,  <f)0  =  $G(t).  Using  analogous  argument  with  the  initial  conditions  (49) 
incorporated,  we  get  for  the  next  order  term: 

O(^)  :  (f>i  =  /3(cost  —  1)  sin  <j)0(t).  (50) 

The  equation  for  the  order  0(  1)  becomes: 


0(1): 


2(3 sin  r  cos  4>04>o{t)  +  /52  cos  r  sin  <f)0  cos  <£0  — —  cos  2 r  sin  (j)0  cos  cf)0 


-q-F.  _  Sin  _  -p*  Sin  0o(t)  cos  4>0{t). 
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Terms  that  are  constants  with  respect  to  the  fast  time  r  are  a  potential  source  for  a  secular  growth 
in  our  solution.  The  condition  for  the  solution  to  be  valid  uniformly  on  t  gives  the  equation  of 
motion  of  the  leading  order  quantity  (10).  Note  also  that  by  taking  the  average  defined  by  (11)  of 
the  equation  (50),  one  gets  (12). 
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1  Introduction 

The  calculation  reported  in  this  paper  is  a  standard  one  for  those  who  study  pattern  forma¬ 
tion  in  nonlinear  systems.  Pattern  formation  is  prevalent  when  instability  breaks  symme¬ 
tries  present  in  a  basic  state.  In  the  canonical  example,  stationary  fluid  heated  from  below 
gives  way  to  patterns  through  Rayleigh-Benard  instability.  For  a  wide  class  of  problems, 
one  may  derive  amplitude  equations  that  govern  the  weakly  nonlinear  development  of  an 
instability.  Amplitude  equations  describe  the  slow  modulation  in  space  and  time  of  distur¬ 
bances  excited  near  the  threshold  of  an  instability.  The  form  of  the  amplitude  equations  is 
determined  generically  by  symmetries  of  the  governing  equations  and  the  structure  of  the 
linear  instability  ([5],  [3]). 

When  a  large,  dissipative,  system  undergoes  a  Hopf  bifurcation,  with  a  trivial  steady 
state  losing  stability  to  a  growing,  unsteady  wave  pattern,  an  amplitude  equation  that 
generically  arises  is  the  complex  Ginzburg-Landau  equation  (CGL) 


8A 

at 


—  [xA  +  v 


d\A 
dx 2 


-(A\At 


(1) 


The  function  A  =  A(x,t)  is  a  complex  valued  function  of  two  real  variables.  It  represents 
the  slowly  varying  amplitude  envelope  of  packets  of  waves  generated  by  the  instability.  The 
variable  x  is  spatial  displacement  in  a  frame  moving  at  the  group  velocity  of  the  unstable 
wavepackets.  The  coefficients  /j,ia  and  £  are  complex,  with  Re{ju}  >  0,  Re{i/}  >  0.  When 
Me  {C}  >  0)  the  cubic  nonlinearity  will  balance  the  linear  growth  term  and  halt  the  growth 
of  the  disturbance  when  \A\  becomes  large  enough.  This  case  is  a  supercritical  bifurcation. 
If  Re{£}  <  0,  the  cubic  term  will  never  balance  the  linear  growth  term  and  the  equation 
predicts  growth  without  bound.  In  such  cases,  the  bifurcation  is  subcritical  and  equation 
(1)  is  not  a  good  asymptotic  description  of  the  nonlinear  dynamics.  Higher  order  terms 
must  be  calculated  that  balance  the  exponential  growth.  Whereas  /i  and  v  can  be  predicted 
from  a  knowledge  of  the  linear  theory  alone,  £  and  the  all  important  sign  of  Re  {£}  cannot 
be  predicted  without  a  nonlinear  theory,  and  must  be  determined  case  by  case  by  direct 
calculation. 

To  understand  the  origin  of  equation  (1),  consider  the  situations  depicted  in  figure  6. 
When  a  system  is  weakly  unstable,  the  unstable  modes  grow  exponentially,  but  very  slowly. 
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Most  of  the  stable  modes,  on  the  other  hand  are  relatively  strongly  damped.  Because  the 
stable  modes  decay  rapidly,  they  are  present  in  the  system  only  to  the  extent  that  they 
are  forced  by  nonlinear  interactions  with  the  unstable  modes.  Their  evolution  is  slaved  to 
that  of  the  unstable  modes.  In  many  cases,  Ginzburg-Landau  type  amplitude  equations 
emerge  from  asymptotic  methods  that  exploit  the  timescale  separation  between  the  stable 
and  unstable  modes. 

Equation  (1)  governs  only  systems  that  are  sufficiently  large  in  the  following  sense.  The 
spatial  variations  of  the  unstable  wave  packets  must  be  produced  by  the  interaction  of  a 
large  number  of  closely  spaced  unstable  modes.  In  a  system  of  finite  size,  the  spectrum 
of  available  wavenumbers  must  be  discrete  to  satisfy  boundary  conditions.  The  smaller 
the  system  becomes,  the  larger  the  separation  between  neighboring  modes  in  wavenumber 
space.  If  the  modes  are  widely  spaced,  then  a  weakly  unstable  state  may  consist  of  only  one 
or  a  small  number  of  weakly  unstable  modes  with  all  others  relatively  strongly  damped,  as 
shown  in  figure  6b.  The  amplitude  equations  governing  such  a  situation  would  be  a  finite 
system  of  real,  ordinary  differential  equations  in  time  for  the  amplitudes  of  the  unstable 
modes  [2].  However,  if  the  system  is  infinitely  large,  then  the  wavenumber  spectrum  is 
continuous.  In  that  case,  when  the  system  is  weakly  unstable,  a  narrow  band  containing  an 
infinite  number  of  modes  becomes  unstable  as  shown  in  figure  6a.  The  nonlinear  interaction 
of  an  infinite  number  of  slowly  evolving  unstable  modes  leads  to  amplitude  PDEs  such  as 
equation  (1),  rather  than  amplitude  ODEs.  Even  when  a  system  is  finite,  if  it  is  sufficiently 
large  that  weak  instability  leads  to  the  nonlinear  interaction  of  a  large  number  of  closely 
spaced  unstable  inodes,  then  equation  (1)  is  still  the  appropriate  asymptotic  description  of 
the  evolution  of  the  instability. 

Though  the  form  of  equation  (1)  can  be  guessed  a  priori  from  considerations  of  sym¬ 
metry  [5],  one  must  calculate  the  equation  in  detail  in  order  to  discover  an  expression 
for  £.  Knowing  the  coefficients  is  useful,  and  not  only  because  the  sign  of  Re{£}  deter¬ 
mines  whether  the  Hopf  bifurcation  is  supercritical  or  sub  critical.  Solutions  of  the  complex 
Ginzburg-Landau  equation  exhibit  a  rich  variety  of  different  qualitative  behaviors  as  the 
coefficients  are  varied.  In  large  regions  of  parameter  space,  spatiotemporal  chaos,  inter- 
mittcncy,  or  the  spontaneous  formation  of  coherent  structures  may  be  observed.  In  others 
regions,  stable,  monochromatic  plane  wave  solutions  dominate.  By  computing  the  coeffi¬ 
cients  in  terms  of  physical  variables,  it  is  possible  to  determine  which  of  these  behaviors  are 
characteristic  of  the  real  physical  system. 

In  this  paper,  we  derive  a  complex  Ginzburg-Landau  equation  for  baroclinic  instability. 
Baroclinic  instability  is  important  in  the  study  of  the  atmosphere  and  oceans.  It  is  the 
mechanism  that  generates  weather  systems  in  the  midlatitude  atmosphere,  and  it  gener¬ 
ates  eddies  in  the  oceans  that  are  responsible  for  a  great  deal  of  heat  transport  from  the 
equator  to  the  poles.  Baroclinic  instability  occurs  when  vertical  shear  flows  driven  by  hor¬ 
izontal  temperature  gradients  in  a  rotating  domain  become  unstable,  and  large,  wavelike 
disturbances  develop  that  redistribute  temperature  fields  in  a  kind  of  horizontally  slanted 
convection. 

A  number  of  models  have  been  used  to  study  this  phenomenon,  the  most  well  known  of 
which  are  the  Charney  model  and  the  Eady  model.  A  basic  introduction  to  these  models 
and  others  can  be  found  in  the  textbooks  by  Pedlosky  [9],  and  Gill  [4].  In  this  analysis,  we 
use  perhaps  the  simplest  model  exhibiting  baroclinic  instability.  Introduced  by  Phillips  in 
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1954  [10],  it  consists  of  a  two-layer  quasi-geostrophic  flow  in  a  rotating  channel  as  shown 
in  figure  1.  Phillips  analyzed  the  linear  stability  of  a  shear  flow  in  which  the  fluid  in  each 
layer  moves  with  a  uniform  zonal  velocity.  The  basic  state  differs  from  that  of  the  standard 
Kelvin-Helmholtz  instability  because  rotation  forces  a  slanting  of  the  free  surface  between 
the  two  layers  in  order  to  balance  the  Coriolis  force  on  the  zonal  flow.  Phillips  found  that 
instability  occurs  when  the  difference  between  the  velocities  of  the  two  layers  exceeds  a 
critical  threshold.  The  model  can  easily  be  modified  to  include  important  physical  effects, 
such  as  dissipation  or  a  planetary  vorticity  gradient  /?. 

The  present  work  is  motivated  by  a  series  of  papers  by  Pedlosky  ([6],  [7],  and  a  paper  by 
Romea  [11]  that  analyzed  the  nonlinear  development  of  baroclinic  instability  in  the  Phillips 
model  in  a  number  of  physically  interesting  situations.  Pedlosky’s  papers,  in  particular, 
were  the  first  to  use  multiscale  asymptotic  methods  to  compute  amplitude  equations  for 
baroclinic  instability.  In  contrast  to  the  present  effort,  Pedlosky  and  Romea  used  periodic 
zonal  boundary  conditions  and  were  therefore  investigating  the  nonlinear  interaction  of  a 
discrete  spectrum  of  unstable  modes.  Consequently,  their  calculation  led  to  amplitude  ODEs 
as  described  above.  Periodic  boundary  conditions  are  physically  motivated  for  a  model  of 
atmospheric  dynamics,  because  the  midlatitude  /3-plane  is  typically  conceived  as  a  periodic 
strip  wrapping  around  the  earth.  A  typical  wavelength  for  a  baroclinic  disturbance  in  the 
atmosphere  is  about  2000  km,  which  leaves  space  for  only  ten  to  fifteen  wave  periods  in  a 
complete  traversal  of  the  globe  at  midlatitudes.  Nonetheless,  there  are  physical  examples 
of  baroclinic  instability  to  which  the  large  aspect  ratio  approximation  is  applicable.  For 
example,  baroclinic  instability  produces  eddies  in  ocean  currents  on  the  scale  of  200  km. 
In  an  ocean  measuring  several  thousand  kilometers  across,  there  is  plenty  of  room  for  large 
scale  structures  to  emerge.  Our  analysis  of  the  large  aspect  ratio  Phillips  model  should 
provide  some  insight  into  the  kinds  of  structures  one  might  expect  in  these  situations. 

To  relate  the  CGL  derived  here  to  preexisting  analysis  of  the  qualitative  behaviors  of 
solutions  of  the  CGL,  we  make  use  of  two  studies  by  Shraiman  et  al  [12]  and  Chate  [1]. 
These  papers  present  a  fairly  exhaustive  numerical  study  of  the  parameter  space  of  the  one 
dimensional  CGL.  By  mapping  the  coefficients  calculated  here  onto  the  coefficients  used 
in  those  studies,  we  determine  what  region  of  parameter  space  is  inhabited  by  baroclinic 
instability.  We  find  that  most  of  the  physical  parameter  space  maps  onto  a  region  of  CGL 
parameter  space  where  non-chaotic,  stable,  monochromatic  waves  are  the  dominant  solution 
at  long  times.  Intermittent  behavior  may  be  possible  when  the  /3  effect  is  strong  compared  to 
dissipation,  but  this  has  not  been  confirmed  either  analytically  or  by  numerical  simulation. 

In  §2,  we  give  a  detailed  description  of  the  Phillips  Model,  the  physical  variables  in¬ 
volved,  and  the  scaling  limits  underpinning  its  derivation.  The  physical  situations  exam¬ 
ined  by  Pedlosky  in  [6],  [7],  and  [8]  are  then  described  and  contrasted  with  the  situation 
considered  here.  In  §3,  the  linear  theory  of  baroclinic  instability  is  outlined,  and  the  CGL 
for  baroclinic  instability  is  derived  in  detail.  In  §4,  we  use  the  calculated  coefficients  to  map 
realistic  physical  values  of  the  variables  onto  Shraiman  et  al  and  Chate’s  parameter  regime, 
thus  giving  a  preliminary  prediction  of  the  structures  that  may  be  observable  in  baroclinic 
instability 
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2  Description  of  the  Phillips  Model 


The  physical  picture  underlying  the  Phillips  Model  is  given  in  figure  1.  Two  layers  of  fluid 
with  different  constant  densities  p\  <  p2  lie  in  an  infinitely  long  channel  of  finite  width  L 
and  height  D .  The  thickness  of  the  lower  layer  is  given  by  h(x,  y).  When  undisturbed,  each 
layer  has  thickness  D/2.  The  fluid  is  bounded  above  and  below  by  rigid  horizontal  planes* 
The  z-axis  is  oriented  along  the  channel,  the  y- axis  is  oriented  across  the  channel,  and  the 
£-axis  points  upward.  The  velocities  iq,  v\,  and  w\  are  the  upper  layer  fluid  velocities  in 
the  x,  y,  and  z  directions  respectively.  The  lower  layer  velocities  are  likewise  called  u 2,  ^2, 
and  u)2 .  The  pressures  are  given  by  p\  and  p2*  Each  layer  has  viscosity  v .  The  gravitational 
acceleration  is  g ,  and  the  entire  channel  rotates  with  angular  velocity  To  include  the 
effect  of  the  earth’s  sphericity,  the  rotation  rate  is  assumed  to  vary  linearly  with  y, 

n  =  \(fo  +  0v)- 


Figure  1:  Physical  picture  of  two  layer  channel  model. 

The  Phillips  model  is  derived  as  a  scaling  limit  of  the  Navier-Stokes  equations  for  flow 
in  the  channel.  The  derivation  is  given  in  greater  detail  in  §2  of  [6],  but  we  will  cover  the 
salient  points  of  the  derivation  here.  Dimensionless  equations  of  motion  are  obtained  by 
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scaling  the  dimensional  variables  as  follows 


{x'  iy')  —  it  »  =  b 


(V  ,/  'l  —  /  _  L.Wa  h'  —  (h  —  D/ 21  ZELUo 
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Pi  ~  Pitl/oL  • 


„/  _  P2+P2g(^-£>/2)-pigD/2 

P2  -  P2V fpL 


where  U  is  a  characteristic  scale  for  the  horizontal  velocities.  On  dropping  primes  we  find 


Dimensionless  Parameter 

Name 

Size 

e  =  U/(f0L ), 

Rossby  number 

« 1 

E  =  2v/f0D2, 

Ekman  number 

~  e2 

p  _  2/q  L2 

g’D 

rotational  Froude  number 

0(1) 

S  =  D/L, 

cross  section  aspect  ratio 

<  1 

0  =  0L2/U 

planetary  vorticity  factor 

0(1) 

Table  1:  Dimensionless  parameters  appearing  in  derivation  of  Phillips  Model, 
that  the  dimensionless  equations  are 
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where 


V2-  — +  ^2 
V<5 


a2  c>2 

fix2  oh/2 


It  is  assumed  that  <5  <  1,  so  that  horizontal  straining  contributes  negligibly  to  viscous 
dissipation.  The  kinematic  condition  at  the  interface  between  the  two  layers  is 


eF 

~2 


dh  dh  dh 

TTT  T  un~Z.  h  vn~K~  i 
at  dx  dy  \ 


=  wn,  at  z  =  bl  +  eFh) . 


(6) 


And  no  normal  flow  at  the  channel  walls  requires 

Vn  =  0,  at  y  =  0, 1.  (7) 

Several  dimensionless  parameters  have  appeared,  all  of  which  are  defined  in  table  1. 
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The  parameter  F  is  a  dimensionless  measure  of  the  width  of  the  channel.  Specifically, 
it  compares  the  channel  width  to  the  distance  a  linear  gravity  wave  on  the  interface  can 
travel  during  a  rotation  period.  The  maximum  speed  of  these  waves  is  cq  —  \/g(D/ 2,  where 
gf  =  g  ( p2  —  p\)  / P2  is  the  reduced  gravity.  The  rotational  period  is  T  =  27t//q.  Therefore, 


F  = 


—  47 r2 


The  Rossby  number  e  measures  the  relative  importance  of  inertial  forces  and  Coriolis  forces. 
We  will  assume  e  4C  1,  so  that  the  influence  of  rotation  will  be  very  strong.  The  Ekman 
number  E  measures  the  ratio  of  viscous  forces  to  Coriolis  forces,  and  determines  the  thick¬ 
ness  of  boundary  layers  in  which  viscosity  plays  an  important  role.  We  set  E  <C  1  with 
i?1/2 /e  =  0(1).  These  limits  are  exploited  by  introducing  asymptotic  expansions  of  all  the 
dimensionless  variables  in  powers  of  e,  such  as 
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Then,  to  leading  order,  the  flow  in  both  layers  is  in  geostrophic.  and  hydrostatic  balance. 
That  is, 
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The  leading  order  flow  is  also  horizontally  nondivergent, 


du^  dvn ^ 
dx  dy 


which  niotivat.es  the  introduction  of  layer  stream  functions  such  that 
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„(0) 
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^  x  V  . 


By  hydrostatic  and  geostrophic  balance,  the  stream  function  is  proportional  to  both  the 
pressure  fluctuation  and  the  height  of  the  interface  disturbance. 

Additionally,  viscous  forces  are  significant  only  in  thin  boundary  layers  near  the  top  and 
bottom  of  the  channel.  Nevertheless,  viscosity  plays  an  important  role  in  the  dynamics  of 
the  bulk.  Carefully  considering  the  dynamics  of  the  boundary  layers,  one  finds  that  vorticity 
in  the  bulk  forces  forces  fluid  to  emerge  from  those  layers  with  weak  vertical  velocities.  This 
phenomenon  is  known  as  Ekman  pumping.  These  vertical  velocities  act  to  damp  vorticity 
in  the  bulk  through  vortex  stretching.  From  here,  one  may  derive  the  evolution  equations 
for  ?/;n  by  manipulating  the  vertical  vorticity  equation  to  obtain 

{l  +  d^iy-d^T^^'+F(^-'P')+^  -  -rVVl  (8) 

(l  +  ^|-|rl)(v2'"2+FW'>-«+«  -  <9> 


where  r  =  E^^e  =  0(1).  In  these  equations,  fast  gravity  waves  have  been  filtered  out  and 
only  the  slow,  vortical  dynamics  remain. 
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To  investigate  baroclinic  instability,  we  write  down  evolution  equations  for  perturbations 
from  a  shear  solution  where  the  velocity  of  the  upper  layer  is  U  and  the  velocity  of  the  lower 
layer  is  —U.  Let 

V’l  =  -Uy  +  ip[,  ip2  =  Uy  +  i>'2. 

and  substitute  into  equations  (8)  and  (9).  Dropping  primes,  we  find 


{§i  +  Ui)qi  +  lt{0  +  2FU)  +  rV2'1’1  =  (10) 

(^~ui)q2+^{/3~2FU)+rS/2i)2  =  (n) 

qi  =  V2V>i  +  F  (if)2  —  V’l)  >  (12) 
q2  =  V2^2  +  F  (ipi  -  ip 2)  •  (13) 


with  boundary  conditions 


&Pn 

dx 


=  0,  y  =  0,1. 


These  equations  are  the  Phillips  model  of  baroclinic  instability. 

Pedlosky  and  Romea’s  papers  worked  with  these  equations  for  a  the  channel  periodic 
in  x.  In  [6],  Pedlosky  derived  amplitude  ODEs  for  the  purely  viscous  case  (3  =  0  and 
r  =  0(1),  and  the  inviscid  cases  with  r  —  0  and  j3  =  0(1)  or  f3  =  0.  Pedlosky  obtained 
a  Ginzburg-Landau  ODE  with  real  coefficients  for  (3  =  0,  r  =  0(1).  However,  the  small 
viscosity  cases  do  not  lead  to  Ginzburg-Landau  type  amplitude  equations,  because  there  is 
no  scale  separation  between  the  decay  rate  of  the  stable  modes  and  the  growth  rate  of  the 
unstable  modes.  Also,  the  case  of  0  <  r  1  is  a  singular  limit  of  the  linear  theory.  The 
introduction  of  an  infinitesimal  viscosity  actually  destabilizes  the  flow,  reducing  the  critical 
value  of  U  by  an  0(1)  amount.  Pedlosky  derived  amplitude  equations  for  this  subtle  case 
with  (3  =  0  in  [7].  In  [11],  Romea  tackled  the  small  r  case  with  /?  =  0(1). 

It  seems,  however,  that  the  case  with  both  (3  =  0(1)  and  r  =  0(1)  has  never  been 
addressed.  It  is  interesting  to  know  how  these  two  effects  compete  when  they  are  of  compa¬ 
rable  strength.  The  infinite-size  limit,  which  introduces  the  possibility  of  spatio-temporal 
disorder  and  localized  structures,  has  also  never  been  investigated. 


3  Derivation  of  CGL 


The  Phillips  model  equations  (10)-(13)  may  be  written  in  the  form 


Wtm 

d 9 


dx 


-  J($,  Mtt), 
0,  for  y  =  0, 1. 


(14) 

(15) 


231 


where 


<L>  = 
M  = 

L  = 

and 


Vh 

ip2 

V2-F  F 

JP  ^2  _  p 

-u£  (V2  -F) -(13  +  2 FU)  §-x  -  rV2 

f/pl 
ur  dx 


U&  (V2  -  F)  -  09  -  2FU)  £  -  rV2 


T ,  4  dA  dB  dA  dB 

^  dx*  dy  dy*  dx' 


The  *  operator  is  termwise  multiplication  of  vectors  without  summing.  That  is 


=  f aibi 

\  «2  /  \  h  J  \  a,2b2 


3.1  Linear  Theory 

Much  of  the  st  ructure  of  the  finite  amplitude  evolution  equations  is  determined  by  the  linear 
instability  properties  of  the  system.  The  linearized  equations  are 

d 

-MT  =  LT, 
dt 

<9T 

—  =  0,  for  y  =  0,1.  (16) 

One  may  seek  normal  mode  solutions  of  the  form 

T  (x,  y ,  t)  =  Me  {  T  {k,  m,  U)e  <(**+"»»-«•<)  J  . 

Substitution  of  this  form  into  equation  (16)  yields  a  system  of  algebraic  equations 


([  +  iwM)  <F  =  0 


(17) 


where 
M(Aq  m) 

L(k,  m,  U) 


-(k2  +  m2  +  F)  F 

F  —  ( k 2  +  m2  +  F) 

U ik  ( k 2  +  77? 2  +  F)  +  •  •  • 

+r  (k2  +  m2)  -  ik  (0  +  2 FU) 

ikUF 


—ikUF 

—ikU  (k2  +  m2  +  F ) - 

-ik  (0  -  2 FU)  +  r  ( k 2  +  m2 


For  a  null  vector  T  to  exist. 


det  (L  +  itulVl)  =  0 


(18) 
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Figure  2:  Growth  rate  curves  for  m  =  n ,  2n ,  3n ,  4n ,  calculated  for  the  critical  shear  U  Uc 
at  which  a  single  mode  k  =  kc  is  marginally  stable. 


must  hold.  Equation  (18)  implies  that  u  must  be  a  root  of  a  second  order  polynomial  with 
coefficients  that  are  functions  of  k,  m,  and  the  physical  parameters  of  the  system.  Thus, 
for  fixed  values  of  these  arguments  there  are  at  most  two  distinct  values  of  u>  satisfying 
equation  (18).  This  condition  defines  the  dispersion  relationships 


iO 


Q  (k,  m,  U,  d ) 


(K2  +  F)  \[&KA  (K4  -  4i?2) U2  +  F2  +  irK2)2, 

l K 2  +  2 F)  +  if2  (iiT2  +  2 F) 


where  d  =  1,2,  and  K 2  =  k2  +  m2.  A  mode  is  stable  if  <  0  and  unstable  if 

lm{n}>0.  .  ,  . 

Because  of  the  boundary  condition  (15),  normal  mode  solutions  exist  only  tor  m  - 
7r)  2tt,  3tt,  . . .  when  k  ±  0.  However,  because  the  channel  is  infinite  in  the  x  direction  there  is 
a  continuous  spectrum  of  solutions  in  k.  Plotting  Im  {G}  for  admissible  k  and  m  produces 
a  discrete  set  of  growth  rate  curves,  as  shown  in  figure  (2). 

When  all  the  growth  rate  curves  lie  in  Im  {fl}  <  0,  the  system  is  stable  to  infinites¬ 
imal  perturbations.  For  fixed  (fc,m),  one  may  compute  the  critical  value  of  the  shear 
U  =  Umaxg(k,  tyi)  at  which  that  particular  mode  becomes  marginally  stable  by  setting 
Im  {f U,d )}  =  0  and  solving  for  U.  One  may  use  the  identity 

Re  j\/a  +  /h  j  =  sgn(o)  a2  +  &2  +  ^ 
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and  perform  some  lengthy  algebra,  the  result  of  which  is 


Unvargik)  Wl) 


N 


{if  K4  (K2  +  F)2  +  F2(32 
K2  ( K 2  +  F)2  (2 F  -  K2) 


(20) 


The  mode  is  stable  for  U  <  £7marg(/c,  m).  Therefore,  the  entire  system  is  stable  for 


U  <UC=  inf  Uma.rg{k,m)  (21) 

0<K2<2F 
m=m r 

It  is  worth  noting  that  equation  (20)  implies  that  instability  is  impossible  for  F  <  7t2/2. 
Furthermore,  one  can  show  that  t/ma rg  increases  monotonically  with  ra,  so  that  the  first 
modes  to  become  marginally  stable  as  U  is  increased  will  lie  on  the  m  =  7r  curve. 


3.2  Nonlinear  modulation 


At  U  =  £/c,  a  single  mode  (A:c,  n)  has  Im  {f2  (kCl  7r) }  =0,  and  all  other  modes  are  stable.  The 
purpose  of  this  work  is  to  learn  what  happens  when  the  shear  is  increased  slightly  above  Uc 
and  the  system  becomes  weakly  unstable.  As  the  modes  grow,  their  nonlinear  interactions 
play  a  pivotal  role  in  the  subsequent  evolution  of  the  instability. 

Figure  (6)  visually  summarizes  the  insight  into  this  situation  that  linear  theory  provides. 
If  U  =  Uc  +  A,  with  |A|  1,  then  the  maximum  growth  rate  is  positive  and  O  (|A|).  The 

growth  rate  curve  is  well  described  by  a  parabolic  function  in  the  neighborhood  of  its 
maximum,  so  there  is  a  band  of  unstable  wavenumbers  of  width  O  (jA|1//2^  around  k  —  kc. 

Any  exact  solution  to  (14),  (15)  can  be  written  in  the  form 

OO  /‘OO 

*(x,y,t)  =  (22) 

n=l 


Here,  ^(/cym,  U)  is  the  mode  eigenvector  defined  by  equation  (17),  and  A^%m(t)  tracks  the 
time  evolution  of  the  amplitude  of  mode  (fc,m).  The  amplitudes  of  the  unstable  modes 
will  dominate  because,  as  mentioned  above,  strongly  stable  modes  decay  rapidly  and  are 
present  due  only  to  nonlinear  forcing  by  the  slowly  evolving  stable  modes.  That  is,  Ak,  m 
should  be  strongly  peaked  near  k  =  kc  on  the  m  =  n  branch. 

Using  this  assumption  to  write  an  approximate  form  for  the  Fourier  series-transform 
solution  (22)  will  motivate  scalings  for  an  asymptotic  solution  of  this  problem.  First,  ne¬ 
glecting  the  summands  for  n  ^  1  and  changing  the  integration  variable  to  center  on  k  —  kc 
gives 


/•OO 

Akc+k',n(t)i>(kc  +  k1, 7r)e  (i((fcc+fc')*+n/-n(fcc+fc',ir.t/e+A)0)dfe/ 

-OO 


Now,  we  make  use  of  the  fact  that  the  dominant  contribution  to  the  integral  comes  from 
kf  =  O  ^A]1/2^.  Let  k  ~  \A\~}/2  kf  and  Taylor  expand  the  integrand  about  kc.  One  finds: 


ty(x ,y,t)  w  i>(k ;c,ir)cMcx+*v-SHke,«,ue)i)  /*  i-(i)ei|A|,/afc(x-^l*=fccO“iArol 

J—oc 


-Vc'dk 
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where  Aj.  —  | A | 1//2  Ak  +|^p/2^(t)-  This  expression  is  the  product  of  the  marginally  stable 
mode  at  criticality  with  an  amplitude  envelope  slowly  varying  in  space  and  time: 


V(x,y,t)&A( |A|1/2  (x 


-  Cgt)  ,  I A  [  f)  7T,  t/c)e  ^(fccrr+Try-net)) 


(23) 


Here, 


(24) 


is  the  group  velocity  of  the  marginally  unstable  mode.  We  also  write  Qc  =  fl(kCl  7r,  Uc )  and 
i>c=  4>(kc,7T,Uc). 

Note  that  there  is  only  one  marginal  wave  at  criticality,  and  not  a  pair  of  waves  traveling 
in  opposite  directions.  This  fact  can  be  established  by  noting  that,  though  the  critical  branch 
satisfies  Em  {£1}  =  0  for  both  kc  and  —  kc,  we  have  Me  {Q  (— fcc,  7r,  1)}  =  —  Me  {Q  (kC)  7r,  1)}. 
This  implies  that  the  critical  modes  (/cc,7r)  and  (— /cc,7r)  differ  by  only  a  phase  shift.  The 
second  branch  associated  with  (fcc,7r)  is  strongly  damped.  This  loss  of  symmetry  is  due  to 
the  /3-effect,  which  imposes  a  directionality  to  the  propagation  of  waves  supported  by  the 
planetary  vorticity  gradient. 


3.3  Method  of  Multiple  Scales 

The  considerations  leading  to  equation  (23)  reveal  the  proper  scalings  to  use  in  a  multiple- 
scales  approach  to  this  problem.  Let 

U  =  UC  +  A,  T=|A|1/2i,  T=|A|f,  X  =  |A|1/2a:. 

Then,  seek  a  solution  for  of  the  form 

#  =  |A|  1/2*W(*,z,y,T,T,X)  +  [A  |  ¥2\t,x,y,T,r,  X)  +  |A|3/2  ^\t,x,y,T,r,  X)  +  ••• 


The  introduction  of  new  space  and  time  scales  requires,  by  the  chain  rule, 


8_ 

8t 

d_ 

dx 


|+iArA+iM|, 


V2  +  2  1 A  |1/2 


d2 


dxdX 


+  |A| 


d 2 

dX 2' 


Substituting  these  transformations  into  the  system  (14),  (15)  and  collecting  terms  of  like 
order  in  |A|,  one  may  obtain  a  hierarchy  of  inhomogeneous  linear  problems.  The  most 
straightforward  way  of  doing  this  requires  the  explicit  manipulation  of  the  terms  of  M 
and  L.  Unfortunately,  proceeding  in  this  manner  produces  extremely  messy  algebra  and 
complicated  expressions  that  are  difficult  to  interpret  in  terms  of  physical  properties  of 
the  system.  However,  it  is  possible  to  develop  the  expansion  in  general  terms  without 
considering  the  detailed  structure  of  the  operators  M  and  L.  By  keeping  track  of  only  the 
formal  structure  of  the  expansion,  we  will  be  able  to  make  useful  simplifications  throughout 
the  computation. 
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When  the  linear  operators  M  and  L  are  applied  to  slowly  varying  wave  packets,  they 
can  be  expanded  in  the  following  way.  First,  note  that 

M  (dx  +  |A|  1/2dx,<9y)  A{X,T,T)4>{k,m)ei{kx+my)  =  M  (k.  -  i  |A|1/2  AVei{kx+my) . 

Formally,  we  may  treat  dx  as  a  variable  and  Taylor  expand  M  to  find 


M 


(dx  +  \&\l/2dXidv)A*ei 


( kx+my )  _ 


-  i  |  A|1/2  m) 


dM 


d 


dkv'  dX 


I A I  d2  M  .  d 2 
2  dk 2  (k,m^dX2  + 


A  4>e^kx+myK 


Thus,  the  formal  expansion  of  M  takes  the  form 


N/Upf,  T,  r)9(kim,U)e 


i(kx+7ny—Qt) 


.4MT  -  |A| 


i/idA 

8X  \  dk 


|A|  &A 

2  dX 2  I  dk 2  / 


0i(kx+m.y—Qi) 


Likewise,  the  formal  expansion  of  L  takes  the  form 


L  A(X,T,T)4>(k,m.,U)e 


i(kx+my—Qt)  _ ^  i(kx+my— Qt) 


al*  -  | a  | 


1/2 


dA  .dl 


i-ty 


dX  \  dk 


+  I A  j  sgn(A)J4  — T  - 


SL  .  1  d2A  ( <92L 


dU  2  dX2  l  dk2 


To  simplify  notation,  let  L0  =  L(fcc,7r,  Uc)  and  M0  =  M(kC)  tt). 

Utilizing  these  expansions  and  collecting  terms  of  O  yields  the  leading  order 

problem 


M'(i) 

dx 


=  0, 

=  0,  for  y  =  0,1. 


(25) 

(26) 


This  is  the  linear  problem  (16)  at  the  marginal  shear  t/c.  There  are  infinitely  many  normal 
mode  solutions,  but  all  will  decay  at  long  times  T  =  0(1)  except  for  the  marginally  stable 
mode.  As  we  are  concerned  with  the  long  time  evolution  of  the  instability,  we  take  the 
leading  order  solution  to  be 

T(1)  =  Re  ^A(X.  T,  T)i> cei{kcX~nct)  sin  Try}  .  (27) 
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Collecting  terms  of  O  (|A|)  yields 


3^(2) 

dx 


0*W  +  i 


d  dM  d VM 
dt  dk  dX 


.did*™ 
ldk  dX 


0,  for  y  =  0,1. 


(28) 


The  operator  on  the  left  hand  side  of  this  equation  is  the  same  as  in  equation  (26).  Thus, 
inhomogeneous  terms  proportional  to  the  marginally  stable  mode  will  produce  a  secular 
response  in  \I/(2).  The  elimination  of  these  secular  terms  introduces  a  first  constraint  on  the 
evolution  of  A(X, T, r). 

Substituting  the  solution  (27)  into  equation  (28),  one  finds 


+Im{^c*M0#c}^|J4|2sin27ry.  (29) 


The  overbar  represents  complex  conjugation.  Since  equation  (29)  is  linear,  takes  the 
form 


$(2)  =  Re  |^(2)  (X,  T,  r)e  i(fccX~n<:t)  J  sin  Try  +  S(2)  (X,  T,  r)  sin  2 Try  +  U{2)  (X,  T,  r) 


The  first  term  represents  a  correction  proportional  to  the  marginally  stable  mode.  The 
second  two  terms  are  independent  of  x,  and  represent  an  0(|A|)  correction  to  the  zonal 
mean  flow.  Substituting  this  form  into  equation  (29)  and  collecting  terms  proportional  to 
sin  27 xy  yields  a  problem  for  B^: 

—  L(0,2t t,Uc)BW  =  -^-Im  {^*1^7}  N2 

Using 

L(0,  2tt,  Uc)  =  47r2rl 

gives 

5(2)  =  -^Im  {4-c  *  M0i)  l^l2  •  (30) 

Collecting  terms  proportional  to  sin7ry  yields  a  problem  for  A^: 


(l0  +  A® 


-  ^  dA 
M°*cdf  +  l 


dl  dM 

dk  +  *Qc  dk 


Vr 


dA 

dX 


Now,  as  demonstrated  by  equation  (17),  the  operator  on  the  left  hand  side  is  singular.  For 
the  equation  to  be  solvable,  the  right  hand  side  must  be  orthogonal  to  the  operator’s  left 
null  vector.  That  is,  for  44  such  that 

$j(u  +  *ncM0)  =  o, 
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we  must  have 


M 


i  dA  . 


dl  .n  dM 
die +  lQc~dk 


dA 

ox 


=  o. 


(31) 


This  equation  implies  something  about,  the  evolution  of  A.  Unfortunately,  it  is  a  mess  and 
difficult  to  interpret  physically,  especially  if  the  matrix  products  are  written  out  in  full.  It 
is  at  this  point  that  our  attention  to  the  formal  structure  of  the  expansion  becomes  useful. 
Note  that 


8 

dk 


(lo  +  ifl(k,Tt,UcJ 


*(k,n,Ue) 


-  0. 


If  we  expand  this  expression  with  the  product  rule  and  evaluate  at  k  —  kc,  we  find 


dL  AM  \  .  dU 

- h  lQr -  thr-  -  —l  - 

dk  cdk  °  dk 


Mo'I'c 


k=kc 


(lo  + 


d*_ 

~dk' 


That  is,  the  operator  splits  into  a  term  proportional  to  the  group  velocity  of  the  marginal 
wave  (sec  equation  (24))  and  a  term  that  is  orthogonal  to  the  left  null  vector  4-4  by  definition! 
Substituting  this  new  relation  into  equation  (31)  yields  a  more  familiar  evolution  equation 
for  A: 


dA 


+  Cg 


dA 

dX 


=  0. 


The  amplitude  envelope  propagates  at  the  group  velocity  of  the  marginally  stable  wave.  We 
now  write  A(X,T,t )  =  A(tj,t)  where  ?;  =  X  —  cgT.  We  also  have 


d  d  d  _  d 
dX  dr)  ’  dT  °9  di) 


It  is  still  necessary  to  solve  for  A^ .  One  solution  is 


A^  = 


.  d4>  dA 
*  dk  di) 


(32) 


One  might  add  a  homogeneous  term  proportional  to  vbc  to  this  solution,  but  the  inclusion 
of  such  a  term  has  no  impact  on  the  results  of  the  computation.  The  term  is  proportional 
to  ^(1),  so  we  may  simply  require  that  it  be  absorbed  into  the  leading  order  solution. 

Finally,  is  determined  by  enforcing  the  boundary  condition  (15).  It  can  be  shown 
[10]  that  the  normal  flow  condition  implies  a  constraint  on  the  zonally  averaged  flow  at  the 
boundaries, 

a-  1  rM  ;i‘», 

=  ,!™cc  2 TiJ_u-m  ’  "  ’ 

This  equation  implies  that,  as  higher  order  corrections  develop,  they  cannot  alter  the  mean 
zonal  flow  at  the  boundaries.  We  deduce  that, 

U{  2)  =  -27 tB{2) 


To  summarize,  we  have  found 

\['(2)  =  Me  |_7^r^j~et(Ac:r~nc<)|  sin7ry-^llm  {4>c*  M04'cj  |,4|2  ^sin 27ry  -  27r  (y  -  . 
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Collecting  terms  of  0  (|A|3/2^  yields 


dk  dt  dk  l  dX 


1U(2) 


d  .  d2  dM  Id2/  d2i  d  d2  M 
drM°~ldXdf  ~dk  +  2dX*  \dk2~dt~dB 


+Al?r(1) 


-J  (V2>,  M W)  -  J  («rW,  M0*®)  +  j  (V1*,  M4E<(1))  .  (33) 


where 


(*w)-K(a 


Since  our  only  purpose  in  proceeding  to  this  order  is  to  find  another  evolution  equation  for  A, 
it  is  not  necessary  to  solve  for  in  full.  We  simply  note  that  the  right  hand  side  of  equa¬ 
tion  (33)  contains  terms  proportional  to  e l(kcx-nct)  sjn 7^  sin27ry,  and  e%^cX^ct^  sinSny. 
Therefore,  since  the  problem  is  linear  we  may  assume 

=  Re  ^(3)  sin  7 ry  +  sin  37ry^  |  +  sin27ry  —  U^y. 

The  evolution  equation  we  seek  will  emerge  as  we  attempt  to  solve  for  .  Substituting 
in  the  solutions  for  and  and  equating  terms  proportional  to  sin7ry  yields 


where 


(Lo  +  *ocm)  A(3)  =  (mA) 


1  \f  d2i  _  d2 M 


^ -  (it\§j*c) A+Uiw +Il2AlA]2 '  (34) 


n-2  ^+^rc+2U+^m 


d L  .  n.dn  d 

—  +  l£lc-z-r  -7TT  +  — 


dkdk  (  °*c)  ’ 


3Im{$c*M0$c}*.(M0$c)  -  , 


where 


M  =  M  (0,  2tt)  + 


-2  F  2  F 
2  F  -2  F 


-4t r2  -  3F  3 F  \ 

3F  -4?r2  -  3F  J  ' 


Using  the  relationship 


one  can  show  that 


Likewise,  using 


q2 

~qJ^2  (C-o  +  ^(k,  Cc)  (fc,  7T,  Uc) J  =  0, 


1  (9^  XV  //v  XV  \ 

ni  =  -2i8FMA-(L»  +  s!5“M”)aF- 


(L0  +  iJ7(fc,7r,C/c)  4>(kc,ir,U)  =  0, 
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one  may  show  that 


dl 

dU 


=  (L0  +  ^cMo) 


cN> 

ou' 


Finally,  forming  the  solvability  condition  as  in  equation  (31)  gives 

dA  .  d2A  _lil|2 
llA  +  ~  \A\  * 


dr 

where  the  coefficients  are  given  by 


d  if 


/x  =  sgn(A)  u  =  -xsgn(A) 
®cM0®c 


dU 


k=kc 


v  — 


c  = 


1 .  d2ft 


2  dk? 


k=kc 


ik2  H  (31m  I'i'c*  M0^c}  *  (m04»c)  -  A4Im  |^c  *  MA|  *  ^c) 


8  r 


(35) 

(36) 

(37) 

(38) 


We  have  derived  a  complex  Ginzburg-Landau  equation  governing  the  onset  of  baroclinic 
instability  in  a  two  layer  model.  The  expressions  for  the  coefficients  are  expressed  in  terms 
of  quantities  computable  from  the  linear  theory  of  baroclinic  modes.  Specifically,  we  have 

*c=(1,7)T, 

where 
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7_  ~i F  F(Uc-c)  ~  l~kc  F  {Uc  —  c) 

where  c  =  Qc/kc.  A  useful  fact  about  7  is  that 
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The  left  null  vector  44  is  simply 


*4=  (r 


Ur. 


Ur+  C 


7 


After  some  algebra,  one  finds  the  following  explicit  expressions  for  the  coefficients: 
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where 
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4  Parameter  Regime  Analysis 


Having  computed  the  coefficients  of  equation  (35)  in  terms  of  physical  variables,  we  are  now 
prepared  to  determine  which  of  the  qualitative  dynamics  of  the  CGL  observed  numerically 
in  [1]  and  [12]  may  be  observable  in  baroclinic  instability. 

In  [1],  Chate  analyzes  the  CGL  in  the  form 


dB 

dt 


=  B  +  (l  +  ibi) 


d2£ 
dx 2 


(h-i)\B\2B. 
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where  &i,  63  are  real  and  63  >  0.  This  simple  form  is  obtained  from  equation  (35)  by  making 
the  transformations 
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from  which  we  find 
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Thus,  the  parameter  space  of  the  CGL  has  two  real  dimensions.1 

The  qualitatively  different  regimes  of  parameter  space  are  mapped  with  respect  to  b\ 
and  63  in  figure  3a.  This  figure  is  reproduced  from  [1].  To  discover  which  regimes  are 
relevant  to  baroclinic  instability,  bj  and  63  were  computed  numerically  for  a  range  of  /?,  r, 
and  F. 

It  was  found  that  61,  63  are  functions  of  F  and  (3/r  only,  though  this  fact  is  not  immedi¬ 
ately  obvious  from  the  formulas  for  (  and  v.  For  all  tested  values  of  F,  /3/r,  we  found  61  <  0. 
At  fixed  F,  decreasing  (3/r  increases  63  and  decreases  \bi\.  As  (3/r  is  varied,  the  coefficients 
roughly  satisfy  6163  =  C(F).  Decreasing  F  at  fixed  (3/r  also  increases  63  and  decreases  |&i|. 
In  the  limit  that  (3/r  — >  0  or  F  — >  7r2/2,  we  find  that  63  — ►  00  and  &i  — >  0.  2  In  this  limit, 

1  Technically,  this  form  can  be  obtained  from  equation  (35)  only  if  Em  {£}  <  0,  as  is  apparent  from  an 
inspection  of  the  transformations.  However,  if  Em  {£}  >  0,  taking  the  complex  conjugate  of  equation  (1)  and 
then  applying  the  transformations  yields  equation  (42)  for  B. 

2Recall  from  §3  that  for  F  <  7r2/2,  baroclinic  instability  is  impossible. 
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(a) 


Figure  3:  (a)  The  parameter  regime  diagram  of  the  CGL  produced  by  Chate,  Shraiman,  et 
ah  Reproduced  from  [1].  (b)  These  points  represent  the  only  points  for  5  <  F  <  50  and 
0.1  <  (3/r  <  50  that  fall  in  the  sector  of  parameter  space  studied  in  [1],  [12].  The  points 
correspond  to  /3/r  ~  10  —  40  and  fall  well  within  the  “no  chaos”  regime. 


the  coefficients  of  the  CGL  are  purely  real.  The  real  Ginzburg-Landau  equation,  unlike  its 
complex  cousin,  is  derivable  from  a  variational  principle,  and  its  solutions  always  relax  to 
a  stationary  equilibrium  state. 

The  values  of  bj  and  bj  for  5  <  F  <  50  and  0.1  <  /3/r  <  50  are  plotted  in  figure  4.  The 
lower  limits  of  these  ranges  were  chosen  to  exclude  divergent  values  of  b%  as  F  — »  7r2/2  «  4.93 
and  /3/r  — *  0.  An  upper  limit  of  50  was  chosen  for  both  F  and  /3/r  to  prevent  these 
parameters  from  being  much  more  than  an  order  of  magnitude  larger  than  one.  It  is  implicit 
in  the  derivation  of  equation  (35)  that  F, /3, r  |A|~1//2.  The  larger  these  parameters 
become,  the  smaller  |A|  must  be  for  equation  (35)  to  be  asymptotically  consistent. 

As  baroclinic  instability  resides  exclusively  in  the  region  b\  <  0,  we  focus  on  the  dy¬ 
namical  regimes  present  there.  For  bj  <  63,  a  band  of  stable  plane  wave  solutions3  of  the 
form 

B  =  5(fc)ei(fcc“w(*w 

exists  with  B 2  =  (l  —  /c2)  /&3  and  u  =  I/63  —  (b]  +  1  /b%)k2.  These  solutions  are  linearly 

3 When  hi  =  63,  this  band  of  wavenumbers  vanishes,  and  stable  monochromatic  plane  wave  solutions 
cease  to  exist.  This  bifurcation  is  known  as  the  Benjamin-Feir  instability  of  the  k  =  0  state.  The  turbulent 
regimes  in  the  Benjamin-Feir  unstable  region  61  >  63  are  the  subject  of  [12],  but  because  these  regimes 
appear  to  be  inaccessible  to  baroclinic  instability  in  the  Phillips  model,  we  omit  discussion  of  them  here. 
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Figure  4:  Values  of  b\  and  63  computed  for  a  range  of  values  of  /3/r  and  F.  Each  streak  of 
like-shaped  markers  corresponds  to  a  fixed  value  of  (3/r  with  F  =  5, 7, 9, 16,  50.  Increasing 
F  with  fixed  /3/r  decreases  63  and  increases  |&i|.  Increasing  /3/r  with  fixed  F  yields  the 
same  trend,  but  the  dependence  of  (3/r  is  stronger  than  on  F. 


stable  for 

fV  \  n 

353  +  bi  ^1 

However,  the  existence  of  stable  plane  wave  solutions  does  not  preclude  the  existence  of 
chaotic  solutions  or  localized  structures  in  the  same  parameter  regime,  Chate  found  that 
for  sufficiently  small  63  solutions  could  be  found  numerically  in  which  localized,  propagating 
structures  separate  large  regions  of  stable  plane  waves.  The  structures  are  characterized 
by  a  sharp  reduction  in  \A\,  and  discontinuities  or  rapid  variations  in  the  phase  of  A.  The 
structures  act  as  nucleation  sites  for  disorder;  the  stable  plane  wave  regions  do  not  break 
down  until  they  are  contaminated  by  one  of  these  structures.  The  nature  of  these  structures 
is  discussed  in  the  context  of  known  exact  solutions  of  the  CGL  at  some  length  in  [1]  and 

[13]- 

However,  it  is  not  yet  clear  whether  these  structures  can  be  expected  to  appear  in 
baroclinic  instability.  For  the  range  of  (3,  r,  and  F,  tested  here,  the  coefficients  bi,  and  63 


(63-61) 
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Figure  5:  (a)  Space-time  plot  of  \A\.  x  increases  left  to  right  and  t  increases  upwards.  White 
represents  the  amplitude  maximum  \A\  =  2.37,  and  black  represents  \A\  =  0. 

(b)  Space-time  plot  of  the  phase  </>  of  A.  Reproduced  from  [1].  In  this  run,  b\  =  —0.75, 
=  0.18. 

generally  lie  far  outside  the  region  of  parameter  space  explored  directly  by  Chate.  Figure 
3b  shows  those  parameter  values  that  did  lie  in  that  region,  and  all  of  those  are  well  within 
the  “no  chaos”  zone  in  which  intermittency  was  not  observed.  For  small  (3/r  and  F,  we 
have  seen  that  the  coefficients  asymptote  to  b\  —  0  and  63  =  0,  where  the  dynamics 
collapse  to  those  of  the  real  Ginzburg-Landau  equation.  It  is  unlikely  that  disordered  states 
will  be  discovered  in  this  limit,  since  solutions  of  the  real  Ginzburg-Landau  always  relax 
to  equilibrium.  But  as  (3/r  increases,  £>3  becomes  small.  This  raises  the  possibility  of 
intermittency  for  sufficiently  large  (3/r.  However,  |6]|  simultaneously  becomes  large  as  (3/r 
increases,  pushing  the  coefficients  out  of  the  sector  of  parameter  space  observed  by  Chate. 
Chate  found  that  as  \bj  |  increases,  63  must  be  ever  smaller  for  intermittency  to  be  observed. 
The  question,  then,  is  whether  63  decreases  quickly  enough  to  counteract  the  stabilizing 
effect  of  increased  \b]\.  Furthermore,  (3/r  cannot  be  increased  without  bound.  We  must 
have  |A|1//2  <C  r  absdel  — 1/2  and  (3  for  equation  (35)  to  be  accurate,  and  thus 

(3/r  |A|.  Numerical  simulations  of  CGL  at  parameter  values  appropriate  to  baroclinic 

instability  are  necessary  to  determine  if  “baroclinic  structures”  will  emerge  or  not. 

5  Conclusions 

We  have  computed  a  complex  Ginzbug-Landau  equation  for  baroclinic  instability  in  the 
Phillips  model.  We  have  compared  the  coefficient  of  this  equation  to  a  parameter  regime 
study  by  Chate  [1]  and  Shraiman  et.  al  [12].  The  comparison  suggests  that,  for  most  physical 
situations,  baroclinic  instability  should  saturate  to  a  monochromatic  wave  train  without 
intermittency  or  spatial  disorder. 

However,  the  search  for  baroclinic  structures  should  not  be  called  off  yet.  The  possibility 
remains  that  localized  structures  and  spatiotemporal  disorder  could  emerge  for  large  (3/r. 
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Further  numerical  simulation  of  the  CGL  is  necessary  to  determine  whether  this  will  happen, 
as  this  region  of  parameter  space  was  not  explored  in  Chate’s  paper.  In  future  work, 
we  intend  to  search  for  baroclinic  structures  in  numerical  solutions  of  both  the  CGL  and 
more  realistic  models  of  baroclinic  instability,  such  as  the  Phillips  model  or  a  continuously 
stratified  QG  model. 
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Figure  6:  In  both  these  figures,  waves  modes’  growth  rates  Em  {f2}  are  plotted  against  zonal 
wavenumber  k.  (a)  When  U  =  Uc  +  A,  zooming  in  around  k  =  kc  reveals  a  band  of  unstable 

wavenumbers  k  with  growth  rates  of  O(A).  The  unstable  band  has  width  O  ^Al1^2^. 

(b)  When  the  channel  has  finite  length,  only  a  discrete  spectrum  of  wavenumbers  k  are 
allowed.  In  this  case,  one  mode  may  become  unstable  alone,  while  all  others  remain  stable. 
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Intermittency  in  Some  Simple  Models  for  Turbulent  Transport 

Arghir  Dani  Zarnescu 


1  Introduction 

Consider  the  passive  scalar  equation 


Tt  +  (u  •  V)T  =  eAT  +  F,  T\t=0  =  <j> 


(1) 


where  T  is  a  quantity  which  is  passively  advected  by  a  fluid  with  velocity  u  and  F  is 
an  external  forcing.  The  quantity  T  can  be  for  instance  heat,  or  dye  used  in  visualizing 
turbulent  effects,  or  a  pollutant.  The  passive  term  refers  to  the  fact  that  the  effect  of  T 
on  the  fluid  is  negligible  so  that  one  can  regard  u,  the  velocity  of  the  fluid,  as  being  an 
externally  given  quantity,  which  does  not  depend  on  the  evolution  of  T. 

Although  (1)  is  a  linear  equation  for  T,  the  relation  between  the  passive  scalar  field  T 
and  the  velocity  field  u  is  nonlinear.  The  influence  of  the  velocity  field  on  the  statistics 
of  T  is  very  subtle  and  difficult  to  analyze  in  general.  For  instance,  the  interplay  between 
u  on  the  one  hand  and  F  and  </>  on  the  other  hand  may  lead  to  rare  but  large  amplitude 
fluctuations  of  T  (in  space,  time  or  both)  which  differ  considerably  from  the  average  and 
contribute  significantly  to  the  statistics. 

The  matter  of  interest  is  then  how  rare  these  large  fluctuations  are.  In  many  situations, 
based  on  the  Central  Limit  Theorem  as  a  heuristical  principle,  one  would  expect  things 
to  organize  themselves  so  that  in  the  average  the  distribution  of  the  variable  of  interest  is 
Gaussian.  But  large  fluctuations  can  be  more  frequent  than  what  is  required  for  the  Central 
Limit  theorem  to  apply,  and  then  fluctuations  can  dominate  the  statistics  in  a  non-Gaussian 
way.  This  phenomenon  is  referred  to  as  intermittency. 

One  signature  of  intermittency  is  the  presence  of  non-Gaussian  tails  for  the  probability 
distribution  function  (PDF  from  now  on)  of  T.  It  should  be  mentioned  that  there  are 
physical  experiments  where  such  a  behavior  has  been  observed  ( [1] ,  [2] ) . 

We  will  be  interested  in  identifying  flows  as  simple  as  possible  in  which  the  large  scale 
intermittency  appears.  Our  goal  is  thus  to  identify  some  of  the  simplest  mechanisms  capable 
of  producing  intermittency. 

While  the  flows  we  choose  are  simplistic,  these  models  can  provide  intuition  about  the 
phenomena  that  occur  in  real  turbulence,  and  with  these  specific  choices  the  calculations 
are  completely  rigorous  and  unambiguous.  This  is  the  path  followed  also  in  [5]  and  there 
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one  can  find  some  more  discussions  on  the  relevance  of  this  kind  of  approach.  Results  in  a 
similar  framework  can  be  found  in  [3],  [4], [6]. 

Our  choice  of  flows  will  fall  in  the  general  class  of  flows  proposed  by  M.  Avellaneda  and 
A.  Majda,  namely  flows  of  the  type 


which  can  be  regarded  as  nonlinear  two  dimensional  shear  velocity  fields. 


2  Heuristics 

We  will  start  by  offering  a  heuristical  interpretation  of  the  mechanism  of  intermittency  in  a 
general  setting  and  then  rigorously  prove  it  for  a  particular  choice  of  flow.  The  explanation 
for  the  decaying  case  (no  forcing)  has  already  been  given  in  [3]  and  it  is  included  here  for 
the  sake  of  completeness. 

In  the  decaying  case  we  have  the  following  representation  formula  for  the  solution  of  (1) 

T(x,t)=  f  <j>{y)g{t,x,y)dy  (2) 

Jr d 

where  git^x^y)  is  the  random  function  (for  fixed  x  and  t )  giving  the  probability  density 
function  of  X(t )  in  each  realization  of  u:  where  X(t)  is  the  solution  of  the  characteristic 
SDE  associated  to  (1)) 

dX(t)  =  u(X(t),  t)dt  +  V&d0(t),X{t  =  0)  =  x  (3) 

where  0{t)  is  a  Brownian  motion  accounting  for  molecular  diffusion.  In  terms  of  X(t),  (2) 
can  be  written  as 


T(x,t)=E^(X(t)) 

where  denotes  expectation  over  0{t)  conditional  on  X(t  =  0)  =  x. 

As  time  evolves  g(t,x:y)  broadens  and  assuming  that  <f)  has  mean  zero,  it  is  clear  from 
the  representation  formula  (2)  that  the  dynamics  will  smooth  out  any  spatial  fluctuations 
in  the  initial  data,  with  an  average  rate  depending  on  the  average  growth  rate  of  the  width 
of  g(t,x,y). 

On  the  other  hand  in  any  realization  where  g(t,x,y)  broadens  abnormally  slowly,  one 
will  observe  a  large  fluctuation  in  the  scalar  field  amplitude  at  point  x,  even  if  the  initial 
data  sampled  by  X(t)  is  very  typical  (see  Figure  1). 


The  situation  is  different  in  the  forced  case.  Then,  the  representation  formula  for  the 
solution  becomes 


T(x,t)=  [  (f)(y)g{t,x,y)dy  +  [  [  F(y)g{t  -  s,x,y)dyds  (4) 

jRd  Jo  JRd 
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T(x,t) 


g(t,y,z) 


Figure  1:  The  heuristics  in  the  decaying  case 


Equivalently,  the  above  formula  can  be  written  in  terms  of  averages  over  X(t)  ,  giving  us  a 
Lagrangian  picture  of  the  evolution  of  T 

T(x,  t)  =  E &(X(t))  +  f  E0xF(X(t  -  s))ds  (5) 

Jo 

In  general,  as  t  —►  oo  one  has  E x(f>(X(t))  =  (p(y)g(t:  x,  y)dy  —>  0  and  thus  one  is  left 

with  analyzing  the  effect  of  the  forcing  term 

/  E %F(X(t  -  s))ds  (6) 

Jo 

Generically  the  trajectories  of  the  X(t  —  s)  will  tend  not  to  be  on  the  level  curves  of  F,  and 
given  the  mixing  effect  of  the  flow  they  will  be  relatively  uniformly  spread  within  a  short 
time,  so  F(X(t  —  s))  will  average  to  a  zero  value.  On  the  other  hand  there  will  be  (rare!) 
realizations  of  u  where  the  effect  of  mixing  will  not  be  so  strong,  the  diffusion  will  be  the 
main  (slow)  mechanism  for  the  spreading  of  X (t),  which  will  happen  in  a  slow  time.  Thus, 
those  X (t  —  s )  which  start  on  a  level  curve  of  F  will  have  the  possibility  of  remaining  on 
the  level  curves  for  a  time  long  enough  so  that  the  average  of  F(X(t  —  s ))  will  be  equal  to 
a  nonzero  constant  (close  to  the  value  of  F  on  that  level  set). 


In  both  cases  rare  realizations  may  have  very  strong  effects  on  large  scales.  This  will 
prevent  the  type  of  averaging  which  leads,  by  the  Central  Limit  Theorem  to  a  Gaussian 
distribution  for  the  PDF  of  T  ,  and  indeed  one  will  observe  ”fat”  (non- Gaussian)  tails  for 
the  PDF,  consistent  with  large  scale  intermittency. 

In  our  approach  we  will  use  a  Lagrangian  picture  as  this  offers  a  simple  understanding 
of  the  phenomena  which  occur.  Indeed,  we  will  consider  the  associated  stochastic  differ¬ 
ential  equations  associated  to  the  passive  scalar  equation  and  we  will  use  them  to  obtain 


250 


representations  formula  for  T  from  which  we  will  compute  the  PDF  of  T.  Thus,  one  can 
see  the  appearance  of  intermittency  as  the  result  of  clustering  of  close  trajectories  in  the 
realizations  where  the  effect  of  turbulent  mixing  is  abnormally  weak. 


3  The  Decaying  Case 


We  will  take  the  flow  to  be 


(7) 


where  g  is  a  Gaussian  random  variable,  with  mean  zero  and  variance  one.  This  is  a  time  in¬ 
dependent,  "periodic  shear"  analogue  of  the  random  shear  model  of  A.Majda  (see  [5]), model 
in  which  the  sin(a:)  from  our  equation  is  just  x  and  g  is  time  dependent. 

In  this  case  (1)  reduces  to 


^  +  gsm(x)*E  =  eAT,  T|t=0  =  4>{y) 


(8) 


We  also  assume  that  the  initial  data  depends  only  on  y  and  it  is  a  mean  zero  Gaussian 
random  process,  statistically  independent  of  the  random  velocity  field,  and 


<f>(y)  =  [  eipy\/rE(p)d,W(p ) 

Jr 


(9) 


with  energy  spectrum 


e(P)  =  cE\k\am  (io) 

where  ip(k)  is  a  cutoff  function,  rapidly  decaying  for  |fc|  >  1,  ^(0)  =  1  and  satisfying 
The  quantity  Ce  is  a  normalizing  constant  and  dW  is  a  complex  white 
noise  process  (independent  of  5),  with 

<  dW(p),dW(q)  >=  S(p  —  q)dpdq 


The  exponent  a  >  —  1  in  formula  (10)  measures  the  decay  of  the  spatial  correlation  of  the 
initial  condition  < p(y)]  the  smaller  cn,  the  longer  the  spatial  correlation. 

In  the  case  where  there  is  no  forcing,  it  is  well  known  that  T  will  decay  to  zero.  Therefore, 
in  order  to  observe  the  intermittency  we  will  look  at  T  rescaled  by  the  energy  E  =  ET2 
which  we  shall  denote  by  0  =  ^=.  We  will  compute  in  the  following  the  PDF  of  0  in  the 
long  time  limit  and  we  will  obtain  that 


F(X  >  A)  «  C\ A  tt+a  as  A  — >  00  (11) 

where  C\  is  a  constant,  independent  of  time,  whose  value  can  be  computed  explicitly  (in 
general  in  the  following  Cq ,  C2 , . . .  will  be  used  to  denote  constants  which  can  be  explicitly 
computed,  and  are  independent  of  time). 

Consider  the  stochastic  differential  equations  associated  with  (8) 
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(12) 


X{t)  =  V2ed.(3x(t) 

Y(t)  =  gsin(X(t))dt  +  \/2 ed(3y(t) 


The  equations  (12)  have  the  solution 


X(t)  =  x+V2e(3x(t) 

Y(t)  =  y  +  gfo  sin(x+  \/2 e(3x(s))ds  +  \f2e(3y{t) 


Since  g  and  /?  are  independent,  we  have  then  that 


E pY (t)  =  y  +  g  /  sin(a:)e  tsds  =  y  +  -  sin(x)(l  —  e  ef) 

J  o  e 

~y+-  sin(x),  as  t  >  oo 


and 


(13) 


(14) 


E0(Y(t)  -E0(Y(t)))2  = 

=  E0g2  fo  Jq  sin(.T  +  %/2 !{3x(s))  sin(x  +  \/2c(3x{s'))dsds'  +  et  +  g2(fg  sin(a;)e~£sds)2 
=  ^  Jj  /o(e_£|f,_5'1  -  cos(2.x)e-e(s+s'+2mi"(s-s,)))dsds'  +  et  +  £  sin2(:r)(l  —  e  et)2 
=  «  +  </2({  +  1=*^)  -  +92sin2(x)!!!^i2  (15) 


Thus,  for  /  1  we  have 


E0(Y(t)-E0(Y(t)))2  ~et  +  g2 


(16) 


Next  observe  that,  using  (9),  we  have  an  explicit  representation  of  T  as 

T  =  E0cf>(Yt)  =  f  eipMt)-hp2v(t)  ^/E(pjdW{p)  (17) 

JR  2 

where  we  used  the  fact  that  (j)  is  a  function  of  only  one  variable;  m(t)  and  v(t)  are  respectively 
the  mean  and  variance  of  Y(t)  with  respect  to  the  Brownian  motion  (3  which,  taking  into 
account  (14)  and  (16),  for  large  t  become 


m(t) 

v(t) 


~  y  +  -  sin(x) 
2  t 

~  at  +  g  — 
e 


Introduce  the  rescaled  variable 

and  the  rescaled  white  noise 


(18) 

(19) 


(20) 


dW,(z)  i  v(t)l.dW(-i=) 

v«w 


where  =  stands  for  the  equality  in  the  sense  of  distributions.  In  terms  of  these  quantities 

(21) 


we  can  rewrite  the  representation  formula  of  T  as 

T  =  f  \/Ce\z\^iIj1/2  (— —=)  dWt(z) 

Jr  \  v  ) 

Therefore  (using  (18)) 


<t>  ~  \[Ce  [  e  2  |^r|  2 -01/2 ( —JL=)dWt(z)  — »  0  ast  — »  00 

Jm  V»(i) 

where  the  limit  here  and  below  is  understood  in  the  sense  of  distributions  and 


<f>=VCE  /  e~\z\2dWt(z) 

Jr 

This  implies  that  for  large  times  we  have 

(1+a)  - 

T(t,-)~v(t )  4  (f) 

Using  the  explicit  formula  for  T  we  can  compute  E(t)  —  E g,pT2  which  is 

Kg,fT2  =  CeE»K«)-H*  [  e-*2|2|“«^=)&] 

JR  VVt 


Prom  (18),  this  is 


.  .  d  _  (Ufa)  .  (1+q)  x 

jE/(t)  —  C/ 2^  2  “h  o(t  2  ) 


By  rescaling  T  we  will  obtain  a  finite  limit,  namely  let  us  consider  the  quantity: 

T 


6  = 


Ve 


Then  the  above  allow  us  to  conclude  that 


6(t,  •)  ~  6(t)  ast  —4  00 


(22) 

(23) 

(24) 

(25) 

(26) 

(27) 


where 


2 

0  =  C3(e+^)-1^^  (28) 

We  can  now  compute  the  tails  of  the  probability  distribution  of  6.  As  </>  is  normally 
distributed  with  mean  zero  and  variance  o  (which  can  be  explicitly  computed,  see  (22)), 
we  have  (assuming  without  loss  of  generality  that  G$  =  l,a  =  1  for  the  sake  of  simplifying 
the  computation): 

2 

<-)  2  roc  r  — — — 

P(bA)  =  F((e  +  ^-)-^>A)=  /  ^LdadPs  (29) 

e  Jo  J\a\S~^~>\  V27T 
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where 


2  o2 

Ps  =  F(e  +  -jy-  <  <5) 


.  e  2  & 


V 


2 


Integrating  by  parts  in  (29)  we  obtain: 


(30) 


F(6  >  A)  = 


(1  +  a)  A 
2y/2ir 


r2zl  _I\2i 

<5  4  e  2A  d 


l  +  Q 


(31) 


In  order  to  compute  the  integral,  for  A  >>  1  thanks  to  the  exponential  factor  and  to 
the  Laplace  method,  we  only  need  to  know  Ps  for  small  S.  Using  the  change  of  variables 

4 

S  =  sX  by  standard,  though  tedious,  computations  we  get  (11)  to  the  leading  order  in 
A  (as  A  — >  oo). 


4  The  Forced  Case 

4.1  The  one  mode,  time  independent,  stirring 

We  will  consider  the  flow  to  be  given  by: 

u  =  (  .  .  9  .  )  (32) 

\sm(x  +  (p)J 

where  g  is  a  Gaussian  random  variable  which  has  mean  zero  and  variance  one  and  ip  is  a 
random  variable  uniformly  distributed  on  [0, 27r].  The  two  random  variables  are  indepen¬ 
dent. 

The  passive  scalar  equation  becomes 

Tt  +  gTx  +  sin  (a:  +  (p)Ty  =  eA T  (33) 

Assume  also  that  the  mean  gradient  of  T  is  imposed 

T  =  j  +  T  (34) 

Then  T  will  satisfy  the  equation 

ft  +  gfx  +  sin(.r  +  <p)Ty  +  s’n(x  +  V3)  _  j1  (35) 

Jj 

In  this  specific  case,  the  general  heuristics  from  the  second  section  can  be  made  more 
precise  and  we  have  a  simpler  mechanism  which  is  responsible  for  intermittency  and  can  be 
understood  as  follows. 

Let  us  assume  that  T  represents  the  temperature  and  we  have  a  region  made  of  two 
parts,  one  hot  ((x,  y )  E  R2  with  x  >  0)  and  one  cold  ((x,  y )  E  R2  with  x  <  0).  In  the  generic 
case,  when  g  ^  0  we  will  have  transport  in  both  x  and  y  directions,  and  thus  mixing  of  the 
cold  and  hot  which  will  lead  to  a  decrease  in  the  average  temperature.  In  the  realizations 
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when  g  «  0,  however,  the  flow  u  points  only  in  the  y  direction,  so  (neglecting  the  effect 
of  diffusion)  there  is  transport  only  in  the  y  direction.  The  hot  region  remains  hot,  and 
the  cold  one  cold.  The  extreme  values  of  temperature  will  not  be  significantly  changed. 
Therefore  one  expects  that  the  rare  realizations  where  g  &  0  will  strongly  influence  the 
average  over  all  the  realizations  leading  to  a  non-Gaussian  distribution  of  T.  Indeed,  we 
will  obtain  that  the  tails  of  the  PDF  of  T  decay  like  A~2. 

In  order  to  make  the  above  reasoning  rigorous  let  us  consider  the  stochastic  differential 
equations  associated  to  (33) 


j  dX(t)  =  gdt  +  \f2 edj3x(t):  Xq  =  x 

\dY(t)  =  sin  (X(t)  +  <f>)dt  +  V^dpy(t),  Y0  =  y 

which  have  the  solution 


f  X  (t)  —  x  +  gt  +  y/2e(3x  ( t ) 

| Y(t)  =  y  Jq  sin  (a;  +  <f>  +  gs  +  y/2efdx  (s))ds  +  \/2ef3y(t) 

Assuming  that  the  initial  data  is  zero  (if  not  it  can  be  shown  it  decays)  by  Feynman-Kac 
formula  we  get  the  following  representation  of  the  solution 

rt  i 

T=-  Ep  j  sin(rc  +  <j>  +  gs  +  V2e/3x(s))ds  (36) 

Jo  T 

Thus  in  each  realization  we  have  that 


lim  T(x,y,t)  —  T(x:t)  =  0  (37) 

t— >oo 

where  T  is 


,fv_N  1  [g  cos(x  +  4>)-e  sin(x  +  0)] 

T(x)  =  I - WT? -  (38) 

We  are  interested  now  in  computing 

/(A)  =  P(f  (x)  >  A)  (39) 

To  this  extent, taking  into  account  the  independence  of  <p  and  g.  we  will  compute  first 
the  moments  only  with  respect  to  the  uniformly  distributed  random  variable  (j).  Indeed,  we 
have 


^Tln  =  [L(e2  +  gz)\ 


1  /'2ir 

—  /  ( 
2Wo 


(gcos(4>)  —  e  sm(<p))2n  dcf> 


(40) 
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One  can  compute  the  last  integral,  namely 


/•27T 

/  (gcos(cj))  —  esin(0))2" 
JO 


r  27r 


nd(f>  = 


r27r 


1 

22” 

2*  Jo  Sm=° 

1  /2n\,  2  , 


^2  n 


I  [(g  +  ei)eix  +  (g  —  ei)e  tx]2n dx 
o 

(g  +  ei)2n~m(g  -  ei)m eix^n-m~mUx 

*-\n)V+*'  (41) 


So 


E^,e 


jfcT 


Soo  ’ 

71=0 


nn  h.2n 

'lj  A'  -E^T2” 


(2n) 


S?=o(-l)”(^)2n  1  ''2  1  -2'-" 


K2LJ  (n!) 2 


[e‘  +  9‘ 


Jo 


- k 2 


L2(e2  +  g2)J 

(where  Jo  is  the  Bessel  function  of  the  first  kind)  and  thus 


m  =  / 

•/R 


_2_ 
e  2 


Jo 


-/c2 


sfa  "  [V  L2(e2  +  g2) 


dg 


(42) 


(43) 


Expressing  /(A)  in  terms  of  its  inverse  Fourier  transform  and  using  the  fact  that  /(A) 
is  real  valued  we  have 


i  r  „  i  r00 

/(A)  =  8?—  /  f(k)elkXd,k  =  3?-  /  f(k)cos(kX)dk 
2n  lia 


k  Jo 


=  JR 


K 


1 


+  .s5R 


v/2ttV2  ^ 

_L_ 

1 


-«2/2J( 


-fc2 


01  V  iV  +  s2) 


-fl2/2 


A?-t2 


Jo 


—  k2 


L2(e2  +  g2) 


dg  )  cos(kX)dk 

cos(kX)dkd,g 


^3/2  •W[-V/  z^-‘aVi^-‘2] 


,-»2/2 


f  J°(/ 


-fc2 


£2(e2  +  <?2) 


cos(kX)dkdg 


wliere  for  the  second  we  used  the  fact  that  the  Bessel  function  of  the  first  kind  is  an  even 
function;  also  for  the  fourth  equality  we  used  Fubini  to  interchange  the  order  of  integration. 
Recall  that 


Jo 


-k2 


Jo  KyL2(e2  +  g2) 

which  is  a  real  number  if 


cos(kX)d,k  = 


e2  +g2 


1  -  L2c2X2  -  L2X2g2 


(44) 


256 


<?e  [- 


1 


L2A2 


—  e* 


1 


L2A2 


<2] 


(45) 


and  purely  imaginary  (i.e.  with  zero  real  part)  otherwise.  Using  this  observation  and 
combining  the  last  two  relations  with  get: 


/(A) 


7T3/2 


V2: 


S2/2 


e2  +  g1 


1  -  L2e2 A2  -  L2A2p2 


dp 


(46) 


which  clearly  holds  if  and  only  if  A  <  ^.  On  the  other  hand,  taking  into  account  the 
definition  of  T{x )  and  of  /(A)  it  is  easy  to  see  that  for  A  >  ^  we  will  have  /(A)  =  0  and 
thus  /(A)  is  a  function  with  bounded  support. 

It  follows  that 


lim/(A)  =  f 

€->0  7T2  JO 


-si 

e  2  g 


which  asymptotically,  in  the  limit  A 


Vi  -  £2AV 


7rL|A| 

oo  behaves  like 

lim/(A)  «  ^?A~2 


e— >0 


(47) 


(48) 
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4.2  The  Gaussian  multimode  forcing 

In  this  section  we  consider  the  flow 


u 


a,) 


(49) 


where  g  is  a  Gaussian  random  variable  which  has  mean  0  and  variance  1,  and  v{x)  is  a 
Gaussian  process  specified  by 


v(x)  =  [  dW(k)y/E{k)t 

Jr 


ikx 


(50) 


(we  will  need  to  assume  that  the  function  E(k)  is  compactly  supported  away  from  0  and 
also  that  fR  --^—dk  <  oo). 

In  this  case  we  obtain  a  similar  behavior  as  before,  though  the  ingredients  are  quite 
different.  Namely  we  will  obtain  that  the  tails  of  the  PDF  of  T  will  decay  like  A-2. 

Indeed,  arguing  analogously  as  before  we  will  obtain  the  solution  will  evolve  as  t  — >  oo 
to  the  solution  of  stationary  equation 


0  —  —  gTx  +  eTx 


v{x) 


(51) 


which  will  give  us  the  representation  formula  for  he  solution: 
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T(x) 


1 

L 

L 


dW{k)^WX)eik(x-9t)-tkH 


dW(k)y/E{k) 


Jkx 


ek 2  + ikg 


(52) 


This  is  a  Gaussian  random  variable  (as  a  superposition  of  Gaussians  )  whose  moments 
with  respect  to  W  are: 


Ew(T(x))2n 


(2w)! 

2  nn\ 


(E  wT\x))n 


2n_  f  E{k)dk 

2»nl\JRk*(ek?  +  g*)' 


Hence 


which  implies 


E  wcikT 


^71=0 


■lnk2n 

(2n)! 


(2  «)! 

2nn! 


F(e,g)n 


-  e-|fc2F(£,fl) 


(53) 


(54) 


/C(A)  =  ^~  f  f  e-92/2e-^'9)eikXdkdg  (55) 

2tt  Jr 

Since  F(e,g)  — >  (when  6  — >  0,  with  M  =  /K  E-jpd-  <  oo)  and  by  Lebesgue’s  dom¬ 
inated  convergence  theorem  if  follows  that  /C(A)— »/(A)  where  (assuming  without  loss  of 
generality  for  the  sake  of  computational  simplicity  that  M  —  1) 
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1  Introduction 

We  begin  by  outlining  the  two  pieces  of  observational  evidence  that  motivate  this  study. 


1.1  Anomalous  diffusion 


The  density  of  the  abyssal  ocean  decreases  by  almost  3%  from  the  ocean  bed  to  the  seat  of 
the  thermocline.  Even  when  compressibility  effects,  such  as  would  exist  in  any  hydrostat¬ 
ically  balanced  fluid  body,  are  accounted  for  there  remains  a  significant  potential  density 
variation  with  depth,  representing  unequal  distributions  of  temperature  and  salinity.  The 
maintenance  of  this  density  stratification  must  be  understood  as  a  dynamic  process.  In  nar¬ 
row  regions  of  the  ocean  at  high  latitudes,  fluid  near  the  ocean  surface  is  cooled  sufficiently 
that  it  becomes  denser  than  the  fluid  that  supports  it,  and  sinks  to  great  depths,  mixing 
with  entrained  fluid.  Accordingly,  latitudinal  sections  of  the  potential  density  in  the  ocean 
often  depict  cold  dense  intrusions  of  fluid  from  the  Antarctic  or  Arctic  Oceans,  see  the 
Figure  1.  It  is  estimated  that  in  the  Pacific  Ocean  such  cold  intrusions  supply  fluid  to  the 
lowest  kilometre  of  ocean  at  a  rate  of  25-30  xl09kgs~*  [16].  There  must  be  a  corresponding 
upwelling  of  fluid  in  the  mid-ocean  or  else  the  centre  of  mass  of  the  fluid  system  would 
be  lowered  with  time.  Such  an  upwelling  would  annihilate  any  variation  in  density,  unless 
thwarted  by  downward  diffusion.  Specifically,  if  we  write  w  for  the  mid-ocean  upwelling 
velocity,  we  can  construct  an  approximate  balance  for  the  two  rates  of  density  transport  in 
the  vertical  direction  z1: 


w 


dp  d2p 


dz 


dz 


(i) 


Balance  in  this  equation  leads  to  equilibrium  distributions  with  a  vertical  scale  height 
H  =  k/w ,  and  with  w  ~  10~7ms”]  and  H  ~  1km  known  for  the  abyssal  ocean,  we  may 
use  this  formula  to  estimate  the  effective  diffusion  of  density:  k  ^  10^4m2s~1  [16].  This 
greatly  exceeds  the  molecular  diffusivities  of  salt  and  heat  (on  the  order  of  10~9m2s":1  and 


1 A  great  deal  of  coarse-graining  of  horizont  al  effects,  and  averaging  out  of  vertical  variations  must  take 
place  before  this  one  dimensional  equation  may  be  arrived  at,  but  the  inferences  drawn  from  it  are  supported 
by  more  nuanced  calculations.  In  particular,  the  balance  may  immediately  be  put  on  a  more  firm  footing 
by  interpreting  2  as  a  local  diapycnal  coordinate  -  that  is  relating  the  isopvcnal  surface- normal  components 
of  the  velocity  field  and  diffusive  flux  [15]. 
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Figure  1:  Potential  density  (a)  distribution  in  a  section  of  the  Pacific  Ocean  stretching  from 
Antarctica  (south)  to  the  Aleutians  (north).  Isosurfaces  are  labelled  by  the  (a  —  1000)  x 
103/kgm~3,  in  increments  ranging  from  0.20  in  the  thermocline  to  0.02  in  the  ocean  abyss. 
The  black  corrugations  are  the  ocean  bed.  (Figure  taken  from  [16]). 


10_7m2s'"1  respectively.  This  disparity  gives  strong  evidence  for  the  dominance  of  other 
dynamical  mixing  processes  over  molecular  diffusion.  In  the  past  decade  strong  observa¬ 
tional  evidence  has  emerged  for  enhanced  eddy  diffusion  localised  within  layers  of  ocean 
hundreds  of  metres  thick  above  rough  or  steepening  regions  of  the  ocean  floor  [7,  11,  13]. 
These  mixing  zones  extend  far  beyond  the  turbulent  boundary  layer  of  ocean  in  immediate 
contact  with  the  bed,  signalling  that  the  anomalous  diffusion  is  a  non-local  effect:  that 
fluid  driven  over  the  ocean  floor  in  tidal  flow,  mesoscale  eddies  or  else  wave-driven  currents 
generates  internal  waves  which  break  at  some  distance  from  the  ocean  floor  and  in  so  doing 
mix  up  the  local  density  field. 


1.2  A  universal  spectrum  of  internal  waves 

The  oceans  are  never  silent,  but  resound  with  internal  inertio-gravity  waves  at  all  length- 
scales.  Compared  to  the  energies  and  velocities  associated  with  the  ever-present  wave  field, 
the  currents  that  are  conventionally  thought  to  control  the  global  transport  of  temperature 
and  salinity  are  in  many  places  rather  feeble.  Studies  by  Garrett  and  Munk  in  the  1970s, 
culminating  in  [4],  showed  that  data  collected  from  moored,  towed  and  dropped  sensor 
studies  of  the  spectrum  of  waves  within  the  ocean  can  be  united  into  a  single  common 
spectrum.  There  are  various  equivalent  ways  for  casting  the  spectrum  (see  Section  2),  but 
one  common  form  is  in  terms  of  the  horizontal  and  vertical  wavenumbers  m  and  kn' 


F(h  ,  _ 3  fNE*m/m* _ 

7r(l  +  +  f2m2) 


(2) 
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Figure  2:  Energy  spectrum  in  (/c//,  m)  space.  Transects  represent  permitted  observational 
probes  of  the  spectrum  by  towed  horizontal  correlation  (THC)  and  dropped  vertical  corre¬ 
lation  (DVC)  detecting  instruments.  The  cleavage  plane  kj-j  =  m(l  —  /*2 /TV2)1/2  does  not 
represent  a  physical  cut-off,  but  is  intended  to  clarify  the  plot. 


where  N  is  the  buoyancy  frequency,  and  /  the  frequency  of  purely  inertial  waves,  and 
the  significance  of  these  two  parameters  will  be  discussed  in  Section  2.  The  wave-field  is 
isotropic  in  any  horizontal  plane,  so  only  a  single  horizontal  wavenumber  enters  the  relation. 
The  spectrum  is  graphically  shown,  including  various  distinguished  limits  of  small  or  large 
wavenumber,  in  Figure  2  drawn  from  [4].  The  variation  of  the  dimensional  spectral  density 
E*  and  the  bandwidth  m*  with  /  (viz  latitude)  and  N  have  been  obtained  theoretically 
and  validated  by  observation  [6]. 

The  existence  of  a  universal  spectrum  has  become  a  dogma  of  oceanography,  allowing 
the  rate  of  mixing  on  centimetre  scales  to  be  backed  out  from  the  amount  of  energy  in  wave¬ 
lengths  of  tens  or  hundreds  of  metres,  scales  which  are  much  easier  to  probe  experimentally. 
A  raft  of  interlocking  assumptions  takes  us  from  the  measurements  of  long  wavelength 
modes  that  can  be  relatively  easily  performed  (using  for  instance  acoustic  Doppler  profil¬ 
ing)  to  the  small  scale  dynamics  of  interest.  The  rate  of  turbulent  dissipation  equals  the 
rate  at  which  energy  is  supplied  from  to  the  mixing  scales,  and  this  can  be  computed  using 
a  semi-empirical  formula  (see  footnote  13  in  [11])  comparing  the  mean-square  shear  rate, 
latitude  and  buoyancy  frequency  to  the  open  ocean  Garrett-Munk  spectrum  at  a  reference 
latitude  of  30°.  The  rate  of  turbulent  dissipation  stands  as  a  proxy  for  the  dissipation  of 
available  potential  energy.  Total  turbulent  dissipation  (e)  is  assumed  to  exceed  dissipation 
of  available  potential  energy  by  some  factor  between  three  and  five.  Thus  the  effective 
diffusivity  k  =  TeN~2,  with  T  taken  to  be  between  0.2  and  0.3.  Various  links  in  this  chain 
of  inferences  have  already  been  scrutinised  theoretically  (see  e.g.  the  discussion  of  the  use 
of  a  constant  value  of  F  in  [10]). 
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1.3  Outline:  A  new  weakly  nonlinear  model 

In  reviewing  the  observation  evidence  for  a  universal  spectrum  of  internal  waves,  and  the 
detailed  use  of  this  spectrum  to  estimate  mixing  rates,  three  questions  should  immediately 
be  apparent  to  the  reader.  How  can  an  equilibrium  spectrum  arise  as  an  equilibrium  state 
of  interacting  waves?  How  do  the  waves  interact?  How  is  the  rate  of  mixing  related 
to  the  energy  present  in  the  wave-field?  There  have  been  several  attempts  to  reproduce 
limits  of  the  Garrett-Munk  spectrum  from  theoretical  arguments,  in  particular  a  recent 
result  by  Lvov  et  al.  [9]  showing  that  the  high  frequency  and  short  wavelength  part  of  the 
spectrum  is  consistent  with  a  model  of  resonant  triad  interactions  between  waves  of  different 
wavenumbers.  In  this  report  we  propose  a  weakly  nonlinear  theory  in  which  interactions 
between  waves  are  viewed  as  spatially  and  temporally  isolated  but  highly  non-linear  events, 
producing  well-mixed  zones  of  fluid.  Between  interactions  waves  evolve  according  to  the 
linearised  equations  of  motion.  The  use  of  linearised  equations  of  motion  to  describe  the 
collapse  of  well-mixed  zones  in  stratified  fluids  is  known  to  result  in  not  too  severe  errors 
[1]  in  predictions  of  the  density  and  velocity  fields. 

2  Evolving  the  linear  modes 

We  restrict  to  two  dimensional  disturbances  with  uniform  background  stratification  and 
no  associated  mean  flow.  We  apply  the  Boussinesq  approximation  that  variation  in  the 
fluid  density  occurs  on  a  much  longer  length-scale  than  variation  in  any  of  the  perturbation 
velocity  fields,  in  which  case  it  may  be  shown  (see  §6.4  of  [5])  that  the  density  may  be  taken 
to  be  constant  in  any  evaluation  of  the  rate  of  change  of  fluid  momentum,  and  density 
variation  admitted  only  when  buoyancy  forces  are  computed.  The  density  field  is  then 
decomposed  into  three  separate  components:  p  =  p0  +  p(z)  +  Sp(x,  2,  t),  and  a  hydrostatic 
component  is  subtracted  off  the  pressure  field  to  balance  the  background  stratification  po+p- 
Following  §8.4  of  [5],  we  may  write  down  a  triple  of  equations  representing  linearised 
momentum  balance: 


r  Px  .  771 

ut-  fv  = - h  F 

Po 

(3a) 

vt  +  fu  =  0 

(3b) 

wt  =  b  —  —  +  H  , 

(3c) 

Po 

where  (F,  0,  H )  are  the  components  of  a  specific  body  force  that  sets  the  fluid  into  motion, 
(n,  v,  w)  is  the  disturbance  velocity  field,  and  b  =  —  ^  the  buoyancy  field.  The  dynamical 
effect  of  the  rotating  frame  has  been  trammelled  up  into  the  pair  of  Coriolis  force  terms  on 
the  right-hand  side  of  (3),  in  which  we  have  followed  convention  by  defining  a  parameter  /  = 
2Q:  centrifugal  terms  are  assumed  to  have  been  assimilated  into  a  redefined  gravitational 
acceleration  g. 

Mass  continuity  then  takes  linearised  form: 

h  +  N2w  =  0  ,  (4) 

while,  consistent  with  the  Boussinesq  approximation,  we  may  assume  that  fluid  parcels 
neither  gain  nor  lose  mass  as  they  are  advected  by  the  fluid,  giving  rise  to  the  standard 
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incompressibility  relation: 


ux  +  wz  =  0  . 


(5) 


We  make  immediate  use  of  the  incompressibility  equation  by  defining  a  streamfunction 
such  that  u  —  w  =  —  rfx,  allowing  us  to  reduce  the  number  of  time-evolution  equations 
by  one.  Specifically,  subtracting  the  x-derivative  of  (3c)  from  the  ^-derivative  of  (3a)  gives 
the  vorticity  equation: 

VVt  -  fvz  -  -bx  +  r  ,  (6) 

where  r  =  Fz  —  Gx  is  the  specific  torque  associated  with  the  body  force  posited  above.  We 
define  a  potential  temperature  for  the  fluid: 


0  = 


gjp  -  po) 
Po 


=  N2z  +  b 


(7) 


so  that  the  disturbed  fluid  is  stably  or  unstably  stratified  according  to  whether  0. 

We  scale  lengths  by  the  horizontal  and  vertical  dimensions  of  our  experimental  box 
(LX)  Lz),  defining  dimensionless  hatted  variables  £  =  Lzz)  x  =  Lxx .  We  also  define  an 
aspect  ratio  e  =  Lz/Lx ,  which  we  anticipate  being  small.  It  is  natural  to  scale  the  buoyancy 
field  using  the  background  potential  temperature:  b  —  N2Lzb ,  and  the  time  by  the  buoy¬ 
ancy  period  t  =  t/N ,  which  will  turn  out  to  be  the  minimum  period  of  any  of  the  linear 
inertia, -gravity  wave  modes  of  the  fluid  body.  Scalings  for  the  other  dynamical  variables 
follow  from  selecting  dominant  balances  between  pairs  of  terms  in  the  vorticity  and  mass 
conservation  equations.  Balancing  the  rate  of  increase  of  vorticity  tyzzt  with  the  rate  of 
baroclinic  generation  suggests  a  scaling  for  the  velocity  fields:  -0  —  and  we 

scale  v  to  balance  the  ^-component  of  the  fluid  acceleration  v%  with  the  Coriolis  force  fu : 
v  —  e*NLz.  To  ensure  that  the  specific  torque  term  participates  in  the  dominant  balance 
of  terms  in  the  vorticity  equation  set  r  —  eN^t.  We  drop  the  hat  decorations  straightaway, 
and  present  in  dimensionless  form  our  remaining  governing  equations: 

(^2  q2  \ 

(2d^  +  d?)^t~ePTVz  +  bx  =  T  (8a) 

evt  +  Prx/jz  =  0  (8b) 

h  ~  e2ipx  =  0  .  (8c) 


Here  Pr  =  f  /N  is  sometimes  called  the  Prandtl  ratio ,  and  encodes  the  relative  strengths 
of  buoyancy  to  inertial  forces.  It  is  natural  to  take  e  =  Pr,  i.e.  to  consider  a  box  with 
aspect  ratio  dictated  by  the  balance  between  rotational  stiffness  in  the  horizontal  direction 
(the  tendency  of  fluid  to  move  in  Taylor  columns),  and  stratification  stiffness  in  the  vertical 
(resisting  any  lifting  of  isopycnal  surfaces).  A  typical  deep  ocean  value  of  the  Prandtl  ratio 
is  Pr  =  0.1. 


2.1  Unforced  modes 

With  F  and  H  set  to  zero,  the  fluid  body  supports  free  inertiogravity  waves.  It  suffices 
to  consider  the  evolution  of  plane-wave  disturbances,  with  well  defined  wavenumber  k  — 
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(&,  0,  ra).  The  time  evolution  of  the  (V>,  v,  b)  fields  must  satisfy  the  triplet  of  equations: 

—  (e2k2  +  m2)xpt  —  iePv  mv  +  ikb  =  0  (9a) 

cvt  +  imPv'ifi  —  0  (9b) 

bt  —  ie^kip  .  (9c) 

Since  there  is  no  explicit  time  dependence  in  these  equations,  we  are  allowed  to  seek  solutions 
with  monochromatic  time  dependence:  (?/>,  v ,  b )  a  for  some  eigenfrequency  u. 

Determination  of  the  eigenvalues  and  eigenvectors  of  the  associated  linear  operator  reveals 
the  existence  of  three  unforced  modes  of  the  body: 

(i)  A  geostrophically  balanced  steady  mode,  with  uj  =  0.  The  pressure  field  is  hydrostatic 
Pz/po  =  b ,  and  the  y-component  of  the  velocity  field  fixed  by  the  Coriolis-buoyancy 
balance  in  (3a): 

'ipb  =  0  and  kbb  =  ePrmVb  .  (10) 

(ii),(iii)  Two  propagating  modes  with  frequencies 

,IN_,  /Pr2  m2  +  e2/c2  ,llN 

w±(fc)-±y  m2  +  e2fc2  >  (n) 

with  wave-components 

Prm^±  =  ±eu>±v±  and  Pr  mb±  =  — e3/cui  .  (12) 


With  a  little  algebra,  we  see  that  an  arbitrary  initial  disturbance  may  be  decomposed 
into  balanced  and  propagating  wave  fields  as: 


=  0  bb 


Pr2  m2b  +  e3fc  Pr  mv  k  Pr  mb  +  e3/c2u 

Pr2  m2  +  fc2e2  6  e  (Pr2  m2  +  e2/c2)  ’ 


(13) 


with 


and 


xp± 


=K 


'ip  =F 


cakb  —  ecu  Pr  m,v 
Pr2  m2  +  e2k 2 


)  i,±=5ke2( 


ip 

T-  + 
o; 


kb  —  ePrmu  \ 
Pr2  m2  +  e2/c2/ 


2e 


± 


Pr  mip  k  Pr  mb  —  6  Pr2  m2u 


cu 


Pr2  m2  +  e2k 2 


(14) 

(15) 


where  we  have  identified  u;  =  o;+,  and  note  that  such  modes  can  then  be  evolved  with  time 
analytically.  Note  that  the  above  expressions  are  singular  if  both  k  and  m  vanish:  this 
corresponds  to  static  raising  or  lowering  of  the  entire  body  of  fluid. 


2.2  Forced  modes 


Grave  modes  of  the  system  are  forced  by  some  external  agency.  To  avoid  imparting  un¬ 
wanted  spatial  or  temporal  structure  to  the  disturbance  thereby  set  up,  we  assume  white 
noise  forcing.  Each  mode  may  therefore  include  a  forced  component: 


d_ 

dt 


ie  Pr  m 
m2-be2/c2 

o 

o 


o 

ie2k 

tPrm 


ik 


m2+e2k 2 

o 

o 


(16) 
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where,  by  appropriate  choice  of  the  unforced  component,  it  suffices  to  consider  the  initial 
conditions  xjj  =  b  —  v  =  0  at  t  =  0.  Now,  it  can  easily  be  seen  that  the  form  of  the  forcing 
is  such  as  to  never  excite  the  geostrophically  balanced  mode,  so  that  any  forced  mode  can 
be  instantaneously  decomposed  into  contributions  from  the  two  propagating  eigenmodes. 
Supposing  that  we  have  not  chosen  to  force  a  mode  with  vanishing  vertical  wavenumber 
(although  the  extension  to  such  purely  buoyancy  driven  waves  is  trivial)  it  is  convenient  to 
chart  the  evolution  of  these  two  modes  via  the  ^-amplitudes: 


v±(t)  =  T 


Pr  m 


2euj(e2k2  +  m2)  J0 


3±iu>(s  ? 


(17) 


where  for  white  noise  forcing  the  integration  measure  may  be  written  as  r(s)  ds  =  To(dVPis  + 
idW2s),  where  tq  is  some  constant  representing  the  strength  of  the  forcing,  and  Wis ,  W2S 
are  independent  Wiener-processes.  We  therefore  see  that  u±(i)  are  both  (complex- valued) 
Gaussian  random  variables,  with  easily  computable  mean  and  expectations.  A  little  algebra 
then  gives  the  evolution  of  the  forced  modes: 


v(k,t)  =  X(k,t)  ,  6(/c,i)  = 


X r{k,t)  and  ^(fc,<)=  ^  Y(k,t )  ,  (18) 


Pr  777 


where  X  and  Y  are  complex  Gaussian  random  variables  with  covariance  matrix: 
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The  stochastic  evolution  of  the  forced  linear  modes  can  therefore  also  be  performed  ana¬ 
lytically.  It  will  be  necessary  in  diagnosing  the  proximity  of  the  system  to  an  equilibrium 
state  to  relate  the  rate  at  which  energy  is  supplied  to  the  system  by  white-noise  forcing,  to 
the  rate  of  dissipation  in  breaking  events.  As  an  intermediate  step  to  doing  this,  it  is  useful 
to  write  down  an  energy  balance  the  distribution  of  energy  between  system  modes: 


\jt  £  (IWI2  +  «4K*)I2  +  <V*2  +  m2mk)\2)  =  f2  E  r(k)^(-k)  (20) 

k  k 


the  first  term  on  the  left  hand  side  gives  the  available  potential  energy  of  the  system  (the 
amount  of  energy  that  would  be  liberated  if  the  preexisting  stratification  were  restored),  and 
the  remaining  two  terms  the  kinetic  energy  for  out-of-plane  and  in-plane  motion  respectively. 
For  freely  propagating  disturbances  it  may  be  shown  that  energy  is  equipartitioned  between 
the  first  pair  of  terms  and  the  third. 


3  Wave-breaking 

Large  amplitude  wave- disturbances  are  vulnerable  to  both  shear  and  Rayleigh-Taylor  insta¬ 
bilities.  In  general  these  two  mechanisms  act  together.  Many  experimental,  numerical  and 
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theoretical  studies  have  addressed  the  cascade  of  instabilities  in  a  linearly  stratified  shear 
layer.  It  is  known  that  for  simple  shear  flows,  shear  instabilities  set  in  only  if  the  Richard¬ 
son  number  (Ri  =  N2  /u2)  does  not  exceed  1/4  [8],  and  direct  numerical  simulations  have 
tracked  the  instabilities  then  produced,  starting  with  the  formation  of  Kelvin-Helmholtz 
billows  that  overturn  the  stratification  gradient,  and  followed  by  production  of  streamwise 
eddies  [12].  However,  this  Richardson  number  criterion  is  known  not  to  be  an  accurate 
predictor  of  instability  in  other  flow  configurations  [3],  and  the  Reynolds  numbers  for  which 
accurate  simulations  of  the  instability-induced  mixing  are  feasible  remain  an  order  of  mag¬ 
nitude  below  those  seen  in  the  ocean.  Regardless  of  the  obscurity  of  the  conditions  needed 
for  instability  and  of  the  kinematics  of  mixing,  experiments  [8]  and  observations  of  atmo¬ 
spheric  clear-air  turbulence  [3,  figure  5]  give  a  clear  and  consistent  picture  of  the  effect  of 
mixing  upon  the  stratification  in  a  fluid:  compact  patches  of  well-mixed  fluid  are  produced 
(with  stratification  obliterated)  and  gravity  waves  shed  into  the  surrounding  medium.  The 
mixed  patches  are  typically  surrounded  by  layers  of  steeply  stratified  fluid,  giving  rise  to  an 
easily  identified  “rabbit-ear”  signature  in  radiosonde  studies  of  the  thermal  profile,  which 
would  correspond  to  sharp  spikes  in  N2  in  our  system. 

We  introduce  a  simple  diffusive  model  for  the  mixing  of  fluid  by  a  breaking  gravity  wave. 
Mixing  is  taken  to  occur  whenever  the  fluid  becomes  gravitationally  unstable  (so  that  at 
some  site  0Z  <  0),  with  no  accounting  for  shear  enhancement.  The  mixing  time-scale  is 
assumed  to  be  much  smaller  than  the  period  of  the  wave  that  triggered  mixing,  so  that  the 
continuing  evolution  of  the  wave-field  can  be  halted  while  mixing  occurs.  For  simplicity, 
mixing  is  assumed  only  to  redistribute  fluid  mass  so  that  the  Eulerian  distribution  of  velocity 
is  frozen  in  during  mixing.  This  is  unphysical,  but  allows  the  question  of  parametrising  the 
turbulent  dissipation  of  kinetic  energy  to  be  side-stepped.  To  select  a  diffusive  model  we 
impose  the  following  constraints: 

(i)  Mixing  must  be  energy-dissipative.  Since  the  velocity  field  is  unaffected  by  mixing, 
this  means  that  the  available  potential  energy  must  decrease  monotonically  with  time. 

(ii)  Mixing  zones  must  have  compact  support,  and  must  include  all  regions  of  fluid  in 
which  0Z  <  0. 

(iii)  Density  must  be  exactly  conserved  at  all  times. 

(iv)  Mixing  must  terminate  upon  reaching  a  stably  stratified  state.  The  mixing  scheme 
should  produce  well-mixed  zones,  rather  than  set  up  a  stable  stratification. 

(v)  The  “rabbit  ear”  structure  should  be  reproduced  in  the  layers  of  fluid  surrounding 
mixed-zones. 

Constraints  (ii)-(iv)  point  towards  a  diffusive  scheme  in  which  the  diffusive  flux  is  propor¬ 
tional  to  the  gradient  in  the  potential  temperature  rather  than  buoyancy  (that  is  J  oc  —  V0, 
rather  than  oc  — V6).  A  simple  candidate  scheme  has: 

Al 

—  =V-(D[0Z]V6)  with  D[8Z]  =  -H(-ez)6z  .  (21) 

Here  the  H(x)  is  the  Heaviside  function,  and  we  have  introduced  a  mixing-time  variable 
T.  In  our  scaled  geometry  V  =  (ejy,  Jj)  .  Check  that  property  (i)  is  satisfied:  Multiplying 
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Figure  3:  Diffusive  scheme  applied  to  the  unstable  buoyancy  profile:  6(x,  z)  =  0.3exp[— ((x— 
0.5)2  +  (z  -  0.5)2)/0.09]  cos[27t(x  +  z)}.  (a)  (b)  are  surface  plots  of  the  potential  temperature 
distribution  before  and  after  mixing,  (c)  and  (d)  give  potential  temperature  profiles  on  the 
transects  x  ~  0,6  and  z  =  0.6  respectively.  The  blue  curve  is  the  profile  before  mixing,  and 
the  green  curve  is  the  profile  after  mixing. 


both  sides  of  (21)  by  b  and  integrating  over  the  entire  of  the  fluid  domain,  we  have 

jt\J  b2dxdz  =  -J  D[6Z]  +  \Vb\A  dxdz  <  0  (22) 

since  by  construction  D  =  0  except  where  db/dz  <  —1.  In  integrating  by  parts  and  dis¬ 
carding  boundary  contributions,  we  have  made  tacit  use  of  the  fact  that  periodic  boundary 
conditions  will  be  imposed  upon  b.  Physically  we  expect  strong  diffusion  initially  in  zones 
where  9Z  <  0,  but  that  diffusivity  will  bleed  away  with  time,  leaving  patches  of  uniform 
0  (in  z  if  not  in  x).  An  example  of  the  application  of  this  diffusive  mixing  scheme  to  an 
initially  unstable  density  profile  is  shown  in  Figure  3.  Note  that  for  typical  aspect  ratios 
(e  «  0.1)  density  is  almost  conserved  at  each  x-station.  At  x-stations  with  a  single  density 
inversion  (i.e.  interval  in  which  6Z  <  0)  this  means  that  the  mixing  produces  a  Maxwell- 
type  construction,  with  the  density  made  uniform  in  the  smallest  z-interval  that  contains 
the  unstable  zone,  while  conserving  total  fluid  mass  and  giving  a  continuous  final  density 
distribution  (see  Figure  3c).  The  weakness  of  density  diffusion  between  x-stations  means 
that  density  distribution  is  markedly  less  smooth  on  horizontal  sections  than  on  vertical 
sections  (sec  Figure  3d).  This  is  intuitively  appealing:  although  the  breaking  inertiogravity 
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waves  respect  the  x  and  z  scalings  introduced  here,  Coriolis  forces  act  only  weakly  upon  the 
turbulent  eddies  generated  during  wave  breaking,  so  that  we  expect  mixing  to  be  isotropic 
in  the  unsealed  x  and  2  coordinates. 

Note  that  condition  (v)  is  not  satisfied  by  the  diffusive  scheme  (21)  which  always  pro¬ 
duces  continuous  6  profiles,  and  does  not  in  general  enhance  the  stable  6  gradients  surround¬ 
ing  a  mixing  zone.  One  remedy  for  this  would  be  to  extend  the  support  of  the  diffusivity 
function  D  to  include  some  region  of  stably  stratified  fluid.  Ensuring  that  this  is  com¬ 
patible  with  the  dissipative  condition  (i)  is  difficult.  In  the  direct  numerical  simulation 
literature  the  Thorpe  displacement  is  sometimes  invoked  for  this  purpose  [12].  The  Thorpe 
displacement  d(z ;  x )  is  defined  for  the  column  of  fluid  occupying  each  of  the  x-stations,  as 
the  minimum  distance  that  the  fluid  particle  at  z  must  be  moved  in  a  vertical  reordering  of 
the  fluid  particles  in  the  column  to  create  a  stable  stratification.  It  has  been  suggested  that 
at  any  instant  the  region  in  which  turbulent  overturning  must  occur  can  be  identified  with 
the  part  of  the  fluid  having  non-zero  Thorpe  displacement  [2].  However,  one  may  easily 
construct  examples  in  which  diffusion  over  the  entire  zone  of  non-zero  Thorpe  displacement 
would  lead  to  a  gain  in  available  potential  energy,  in  violation  of  condition  (i).  A  more 
promising  approach  attempts  a  more  careful  budgeting  of  the  energy  available  for  mixing 
from  both  the  kinetic  energy  and  available  potential  energy  of  the  flow.  The  diffusivity  D  is 
identified  with  the  amount  of  turbulent  energy  present,  and  is  allowed  to  self-diffuse.  Zones 
of  fluid  in  which  6Z  ^  0  are  treated  as  diffusivity  sources  and  sinks  respectively.  While  these 
models  allow  diffusion  over  significantly  larger  fluid  regions  than  (21)  and  may  therefore 
satisfy  (v),  and  can  be  constructed  so  as  to  conserve  [14]  or  dissipate  energy,  they  also 
require  the  addition  of  multiple  ill-constrained  parameters  for  the  separate  diffusivities  of 
density,  momentum  and  turbulent  energy. 

4  Numerical  implementation 

A  cartoon  of  the  numerical  scheme  for  combining  linear  evolution  (§2)  with  diffusive  mixing 
(§3)  is  given  in  Figure  4. 

We  describe  briefly  some  of  the  numerical  desiderata.  We  impose  periodic  boundary 
conditions  upon  the  6,  x/j  and  v  fields,  and  discretise  the  numerical  domain  with  a  grid  of 
M  points  in  the  x  direction  and  N  points  in  the  ^-direction.  Typically  we  let  M  and  N 
range  from  32  up  to  256.  Fast  Fourier  Transforms  are  used  to  pass  between  physical  and 
wavenumber  representations  of  the  wave  fields.  The  time  interval  At  over  which  the  fields 
are  allowed  to  evolve  between  mixing  events  is  held  fixed  throughout  the  simulation,  so  that 
the  evolution  of  the  unforced  components  can  be  determined  in  advance  by  the  computation 
of  time  evolution  operators  exp(=fia?(fc)At)  for  each  of  the  modes.  Stochastic  evolution  of 
the  forced  modes  requires  us  to  generate  the  Gaussian  random  variables  X  and  Y  that 
feature  in  the  equation  (18).  We  do  this  by  calculating  the  covariance  matrix  (denoted  by 
C(At\k))  for  (5ftAr,  QA,  QT),  and  finding  its  Cholesky  decomposition  C  =  LLT.  The 
requisite  A,  Y  at  each  time  step  may  then  be  generated  as  (3? A,  QA,  5ft  T,  SsY)  =  L£,  where 
£  is  a  quadruple  of  JV(0, 1)  random  variables. 

For  the  implementation  of  the  diffusive  mixing  step,  spatial  derivatives  are  approximated 
by  second  order  centred  differences,  and  the  time  stepping  is  performed  with  a  fully  implicit 
second  order  scheme  (the  Matlab  routine  ode23s,  based  on  the  Rosenbrock  formula,  which 
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Figure  4:  A  numerical  scheme  for  combining  linear  evolution  with  diffusive  mixing. 


we  have  altered  to  make  use  of  UMFPACK  to  perform  an  LU-factorisation  of  the  large  but 
sparse  Jacobian  matrix).  To  ensure  that  the  associated  system  of  time  equations  have  a 
well-defined  Jacobian  it  is  necessary  to  smooth  the  Heaviside  function  term  appearing  in 
the  diffusivity.  In  practice  we  use: 

H(x)  *  I  (x  +  Vx2  +  4e)  ,  (23) 

where  the  smoothing  length  e  is  set  at  machine  precision  e  &  10~12  without  any  evident 
irregularity  in  the  running  of  the  code.  Diffusion  was  terminated  when  9Z  exceeded  some 
critical  value  (typically  -0.005)  throughout  the  fluid  domain.  Numerical  results  for  the 
mixing  step  were  tested  using  a  finite  element  package  (COMSOL  Multiphysics  3.2). 

There  are  two  fundamentally  different  experiments  that  can  be  performed  using  the 
numerical  scheme  described  here.  In  the  first,  ■  relaxation ,  all  wavenumbers  are  initially 
given  identical  energies,  randomly  allocated  between  leftward  and  rightward  propagating 
modes,  and  with  uniformly  randomly  distributed  phases  for  each  component.  The  system  is 
then  allowed  to  evolve  without  forcing  until  it  reaches  equilibrium  with,  in  the  end  stages, 
exponential  decay  in  the  total  energy,  and  increase  in  the  waiting  time  between  mixing 
events.  In  the  second  experiment,  build-up ,  one  or  two  of  the  gravest  modes  of  the  system 
are  supplied  with  white  noise  forcing,  and  the  transmission  of  energy  from  these  modes  to 
other  modes  is  charted. 
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Figure  5:  (a)  Effect  of  varying  initial  state.  The  three  curves  give  relaxation  dynamics  of 
system  starting  with  different  random  initial  states,  (b)  Green  data  set  replotted  on  log-log 
scale.  All  simulations  are  run  with  M  =  N  =  64. 


5  Results 

5.1  Relaxation  experiments 

The  effect  of  the  initial  state  upon  the  evolution  of  the  system  is  shown  in  Figure  5a.  Only 
the  energy  in  propagating  modes  is  plotted  -  the  contribution  from  geostrophically  balanced 
modes  is  an  order  of  magnitude  smaller.  In  part  b  of  the  figure,  one  of  the  data  sets  is 
replotted  on  a  linear-log  scale  to  show  the  exponential  convergence  of  the  total  wave  energy. 
It  can  be  seen  that  the  initial  state  is  not  forgotten,  but  helps  to  determine  the  total  energy 
that  the  system  relaxes  to.  Two  systems  with  initially  closely  separated  energies  ultimately 
equilibrate  with  similar  energies,  as  the  red  and  blue  curves  show. 

Phase  structure  in  evolved  states.  It  must  be  asked  whether  the  equilibrium  states  of 
the  system  have  definite  phase  as  well  as  energy  spectra  -  i.e.  that  the  different  wave 
components  must  have  specific  phase  lags  to  avoid  constructive  interference  that  may  lead 
to  breaking.  We  test  for  this  by  taking  one  of  the  late  time  system  states  from  Figure  5a, 
randomising  all  of  the  phases  and  allowing  it  to  evolve  with  time,  to  see  if  the  equilibrium 
is  altered.  Results  are  shown  in  Figure  6.  It  is  seen  that  the  apparent  equilibrium  energy 
of  the  propagating  modes  (which  is  found  by  fitting  the  energy-time  curve  to  a  decaying 
exponential  and  extrapolating  to  infinity)  varies  by  less  than  2%.  This  suggests  that  the 
equilibrium  phase  spectrum  of  the  system  is  white. 

The  effect  of  varying  the  time-interval  between  mixing  events  is  shown  in  Figure  7,  in 
which  an  identical  initial  state  is  let  evolve  three  times,  with  different  time  intervals  At 
between  mixing  events  for  each  of  the  iterations.  The  energy  of  the  system  is  sensitive 
at  early  times  to  the  value  of  At,  but  not  the  value  of  energy  that  the  system  ultimately 
converges  to.  Smaller  values  of  At  give  faster  convergence  to  the  equilibrium  energy. 

Evolved  spectra .  In  Figure  8  we  compute  the  detailed  distribution  of  energy  among  wave- 
modes  for  the  green  data  set  from  Figure  5.  The  spectra  corresponding  to  other  data  sets  are 
qualitatively  similar,  and  we  are  developing  methods  for  direct  comparison.  The  redness  of 
the  spectrum  is  similar  to  that  of  the  Garrett-Munk  spectrum,  although  lack  of  resolution 
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Figure  6:  Effect  of  randomisation  of  phase  of  wave  modes  upon  the  relaxation  of  a  system 
to  equilibrium.  Green  curve  shows  initial  evolution  (identical  to  the  green  data  set  in 
Figure  5),  and  purple  curve  the  continuing  evolution  after  phases  are  randomised  at  a  time 
t  =  3.3  x  104. 


Figure  7:  Effect  of  varying  time  interval  between  mixing  events  upon  relaxation  dynamics. 
Red  curve  corresponds  to  A t  =  13.40,  blue  curve  to  At  =  4.46,  and  cyan  curve  to  At  =  1.12 
(all  times  are  measured  in  units  of  1/AT). 
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Figure  8:  Relaxation  spectra  in  (a)  (|fc|,  |m|)  and  (b)  (|m|,  |tu|)  space. 


prevents  direct  comparison  of  the  scalings  for  the  inertial  peaks.  The  spectrum  is  also 
peaked  at  small  m  (corresponding  to  purely  buoyancy  driven  waves)  in  visible  disagreement 
with  the  GM  spectrum.  These  modes  correspond  to  lifting  of  vertical  columns  of  fluid,  and 
do  not  therefore  participate  in  breaking:  either  in  determining  whether  breaking  will  occur, 
or  in  mixing,  because,  as  was  discussed  in  §3,  this  mainly  leads  to  vertical  transport  of  mass. 
The  persistence  of  these  modes,  once  excited,  is  a  knotty  problem  for  the  model. 


5.2  Build-up  experiments 


It  is  not  feasible  to  force  a  single  mode  of  the  system,  since  any  spatial  periodicity  of 
the  forced  mode  will  be  inherited  by  the  modes  created  during  wave  breaking,  leading 
to  a  sparse  energy  spectrum.  To  break  this  symmetry  we  force  a  pair  of  grave  modes 
(fe,  m)  =  (27r,±27r)  with  the  same  forcing  constant  to,  and  the  first  mode  initially  just 
below  its  breaking  amplitude  and  the  second  mode  started  from  zero  amplitude.  It  is  also 
necessary  to  impose  an  adiabaticity  constraint  upon  the  forcing,  that  the  time  taken  for  the 
forcing  to  bring  the  forced  mode  to  breaking  must  greatly  exceed  the  period  of  the  mode, 


i.e.  that: 

/ cuV2  m<1  +  f2k2 

T°  ^  V2tt/  e2fcm 


(24) 


For  tq  significantly  greater  than  this  threshold  value,  the  energy  in  the  forced  modes  is 
observed  to  grow  without  bound.  Wave-breaking  only  removes  energy  from  the  buoyancy 
field,  and  we  can  only  be  assured  that  energy  is  equipartitioned  between  the  available 
potential  energy  (plus  the  out-of-plane  kinetic  energy)  and  the  in-plane  kinetic  energy  if  the 
adiabaticity  condition  is  met. 

It  can  be  seen  that  the  energy  spectrum  is  dominated  by  the  handprint  of  the  forced 
modes.  These  modes  remain  saturated  (at  the  brink  of  breaking)  and  transmission  of  energy 
to  other  modes  is  inefficient.  In  Figure  9a  we  show  the  total  energy  budget  for  one  realisation 
of  this  system,  showing  that  that  it  attains  a  flux-dissipative  equilibrium  (with  the  rate  of 
dissipation  by  mixing  equal  to  the  energy  input  from  the  white  noise).  Experimentally  the 
mean  energy  value  in  this  equilibrium  depends  upon  the  particular  modes  being  forced, 
but  not  upon  the  strength  of  the  forcing  tq  or  upon  the  time  between  mixing  events  At. 
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Figure  9:  (a)  Energy  budget  for  forced  mode,  showing  rate  of  energy  input  from  white  noise, 
against  total  energy  of  all  wave  modes,  (b)  Time-averaged  energy  spectrum  when  system  has 
attained  flux-dissipative  equilibrium.  Simulations  were  run  on  an  older  version  of  the  code, 
on  an  M  =  N  =  25  grid,  with  unnormalised  energies  (which  must  be  divided  by  M2N 2  =  220 
for  comparison  with  Figure  5)  and  anisotropic  diffusion  (setting  V  =  (d/dx:  d/dz)  in  (21)), 
but  are  in  qualitative  accord  with  experiments  using  the  modified  code  described  in  this 
report. 


The  rate  of  energy  increase  in  unforced  modes  is  slow,  and  it  is  unclear  whether  a  forced- 
dissipative  equilibrium  has  actually  been  reached  by  the  end  of  the  simulation.  In  truth 
Figure  9a  probably  represents  no  more  than  the  achievement  of  a  flux-dissipative  equilibrium 
for  the  single  forced  mode  of  the  system.  In  Figure  9b  we  show  the  (coarsely- binned)  energy 
spectrum  of  the  system  as  a  function  of  wave-number,  showing  clear  peaking  at  the  forced 
wavelengths.  It  may  be  possible  to  clarify  whether  equilibrium  has  been  attained  by  running 
a  hybrid  of  the  relaxation  and  build-up  experiments,  in  which  the  other  modes  are  given 
some  initial  energy  and  allowed  to  relax  to  rather  than  build  up  to  a  steady  state. 

6  Discussion 

Basic  questions  about  the  capabilities  and  limitations  of  the  model  remain  unasked.  The 
preliminary  simulations  described  here  show  that  a  large  number  of  energy  equilibria  can 
be  accessed  when  an  unforced  wave-field  is  allowed  to  relax  from  some  higher-energy  initial 
state.  These  states  appear  to  all  have  structure-free  phase  spectra  and  relatively  homolo¬ 
gous  energy  spectra,  suggesting  that  the  family  of  equilibrium  states  could  be  parametrised 
by  their  total  energy,  and  we  are  in  the  process  of  running  simulations  to  simulate  relaxation 
for  a  large  assay  of  initial  states  at  a  higher  resolution,  in  order  to  confirm  this.  The  problem 
of  the  persistence  of  m  =  0  states  remains  unresolved.  Results  of  the  build-up  simulations 
are  less  promising  -  a  method  must  be  found  to  subtract  out  the  forced  mode  from  the 
final  spectra  (Figure  9),  or  for  hastening  the  equilibration  of  these  modes.  Incorporation  of 
forcing  is  vital  to  our  efforts  to  use  the  model  to  tackle  the  problem  of  anomalous  diffusion 
(§1.1),  since  the  rate  of  mixing  will  be  ultimately  controlled  by  the  rate  of  energy  input  into 
the  wave-field. 
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1  Introduction 

1.1  Oceanic  background 

Mesoscale  vortices  have  recently  been  recognized  to  play  an  important  role  in  redistribution 
and  transport  of  water  properties  (e.g.  temperature,  salinity)  around  the  oceans.  The 
interaction  of  vortices  with  seamounts,  submerged  ridges,  or  islands  might  result  in  enhanced 
and  localized  transfer  of  anomalous  fluid  from  the  vortices  to  the  surrounding  environment. 
In  addition,  the  interaction  could  end  in  the  formation  of  new  vortices  downstream  otherwise 
complete  destruction  of  the  incident  vortices.  This  topic  has  been  investigated  for  the  past 
several  decades  for  e.g.  Meddies  in  the  eastern  North  Atlantic,  Agulhas  rings  in  the  eastern 
South  Atlantic,  and  North  Brazil  Current  (NBC)  rings  in  the  western  tropical  Atlantic.  In 
the  current  study,  we  will  focus  on,  in  particular,  the  behaviour  of  the  last  kind  of  vortices, 
NBC  rings  which  interact  with  the  Lesser  Antilles. 

It  is  believed  that  NBC  rings  are  one  of  the  leading  mechanisms  for  transporting  the  up- 
per  ocean  equatorial  and  South  Atlantic  water  into  the  North  Atlantic  as  part  of  the  Merid¬ 
ional  Overturning  Cell  (MOC).  The  MOC  transports  cold  deep  water  southward  across  the 
equator  and,  to  be  balanced,  transports  upper  ocean  South  Atlantic  waters  northward.  In 
the  upper  layers,  the  NBC  is  a  northward  flowing  western  boundary  current  that  carries 
warm  water  across  the  equator  along  the  coast  of  Brazil  (Figure  1).  Near  5°  —  10°  A, 
the  NBC  separates  sharply  from  the  coastline  and  retroflects  to  feed  the  eastward  North 
Equatorial  Counter  Current  (NECC)  [7].  During  its  retroflection,  the  NBC  occasionally 
pinches  off  isolated  anticyclonic  warm-core  vortices  exceeding  450  km  in  overall  diameter, 
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2  km,  in  vertical  extent,  and  swirling  at  speed  approaching  100  cms~1.  These  NBC  rings 
move  north-westward  toward  the  Caribbean  at  8  -  17  cm,s~l  on  a  path  parallel  to  the  coast¬ 
line  of  Brazil.  As  part  of  the  MOC,  in  most  cases,  they  then  interact  with  a  complex  island 
chain,  the  Antilles  islands  [6]  and  enter  the  Caribbean  Sea.  (Episodically,  they  enter  the 
North  Atlantic  subtropical  gyre.)  The  inflow  into  the  Caribbean  Sea  ultimately  feeds  the 
Florida  Current  which  is  now  recognized  to  be  a  fundamental  passage  for  northward  trans¬ 
port  of  upper  ocean  waters  in  the  global  thermohaline  circulation.  Therefore,  the  Atlantic 
MOC  (hence  NBC  rings)  is  an  important  element  of  the  global  thermohaline  circulation 
and  a  fundamental  component  of  the  global  climate  system.  Recent  observations  reveal 
that  relatively  large  (average  diameter  200  km)  energetic  anticyclonic  vortices  were  found 
downstream  of  the  Antilles  islands  in  the  Eastern  Caribbean  Sea  and  translated  westward  in 
the  central  part  of  it  whereas  cyclonic  vortices  were  observed  primarily  near  boundaries  in 
the  Eastern  Caribbean  Sea  [11]  (Figure  2).  Unfortunately,  it  is  difficult,  by  observations,  to 
know  whether  or  not  such  large  anticyclonic  and  cyclonic  vortices  observed  in  the  Eastern 
Caribbean  Sea  have  been  produced  as  a  consequence  of  the  interaction  between  NBC  rings 
and  the  Antilles  islands,  and  if  so,  how  they  have  been  formed.  In  the  present  work,  we 
shall  try  to  answer  part  of  this  question  through  laboratory  experiments. 


Figure  1:  Sketch  of  the  upper-ocean  circulation  in  the  western  tropical  Atlantic  from  [5], 


1.2  The  previous  works 

Before  mentioning  a  possible  mechanism  for  the  large  anticyclonic  and  cyclonic  vortices 
formation  in  the  Eastern  Caribbean  Sea,  let  us  introduce  briefly  two  previous  works  on 
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Figure  2:  Drift  trajectories  of  28  cyclones  (blue)  and  29  anticyclones  (red).  Anticyclones  seem  to  be 
dominant  in  the  Eastern  Caribbean  Sea  between  65° W  and  75 °1V,  from  [11]. 

interaction  of  vortices  with  multiple  islands. 

The  interaction  of  a  monopolar,  self-propagating  cyclonic  vortex  with  two  circular  cylin¬ 
ders  was  investigated  in  the  laboratory  [2]  (Figure  3  (a)).  Typically  after  the  vortex  came 
in  contact  with  the  two  cylinders,  the  outer  edge  of  the  vortex  was  peeled  off  and  a  so-called 
“streamer”  (or  two  “streamers”)  went  around  one  of  the  cylinders  (or  each  of  the  cylinders) 
(Figure  3  (b)).  When  the  streamer  velocity  vs  was  large  enough  (i.e.  400  <  Re  <  1100 
where  Re  =  vsd/u ,  and  d  is  the  diameter  of  the  incident  vortex),  the  “streamer (s)”  turned 
into  a  new  cyclonic  vortex  (or  two  new  vortices).  During  the  experiments  in  [2],  three 
parameters  were  varied:  G,  the  separation  between  the  cylinders;  d\  and  Y,  the  perpen¬ 
dicular  distance  of  the  center  of  the  vortex  from  an  axis  passing  through  the  center  of  the 
gap  between  the  cylinders  (see  Figure  3  (a)).  One  of  the  remarkable  observations  in  [2]  is 
that  the  flow  within  the  vortex  was  “funneled”  between  the  two  cylinders  and  formed  a 
dipole  vortex,  much  like  water  ejected  from  a  circular  nozzle  generates  a  dipole  ring.  This 
behaviour  occurred  provided  that  —2  <  Y/g  <  0,  0.25  <  G/d  <  0.4,  and  Rec  >  200,  where 
g  —  G/ 2,  Reg  =  UgG/u  is  the  Reynolds  number  based  on  a  length  scale  ~  O(G),  and  Uq 
is  the  maximum  velocity  of  the  vortex  fluid  in  the  gap.  The  size  of  the  created  cyclonic  and 
anticyclonic  vortices  (i.e.  a  dipole)  was  smaller  than  that  of  the  original  vortex. 

A  second  relevant  work  is  a  numerical  investigation  of  the  interaction  of  both  a  self- 
propagating  and  an  advected  vortex  with  multiple  islands  [13].  The  islands  were  represented 
by  thin  vertical  walls  aligned  in  the  North-South  direction  with  gaps  having  a  width  of 
20  %  of  the  vortex  diameter.  This  study  showed  that  if  the  individual  islands  were  small 
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compared  with  the  vortex  radius  (e.g.  L/R 4  =  0.3  where  L  is  the  island  length,  Ri  is  the 
initial  vortex  diameter1),  the  vortex  reorganized  in  the  basin  downstream  of  the  islands, 
whereas  it  always  split  into  multiple  offsprings  if  the  islands  were  large  (e.g.  L/R 4  =  1.5) 
(Figure  4).  Moreover,  intense  vortices  experienced  relatively  greater  amplitude  loss  than 
weak  vortices.  The  results  of  [13]  may  give  an  account  of  the  observations  of  anticyclones  in 
the  Eastern  Caribbean  Sea,  but  the  generation  of  cyclones  in  the  Sea  can  not  be  explained 
by  their  results  as  no  cyclones  were  seen  in  [13]. 


Figure  3:  (a):  sketch  illustrating  the  geometry  of  the  encounter  between  the  vortex  and  two  cylinders, 
from  [2].  The  diameter  of  the  cylinders,  D,  is  5  cm.  (b):  sketch  of  a  streamer,  from  [3]. 


Figure  4:  Multiple-islands  numerical  experiments  from  [13].  (a):  L/R ?  =  0.3.  (b):  L/R4  =  1.5. 


1.3  Hypothesis 

Since  the  Lesser  Antilles  have  passage  width  between  30  —  60  km  and  the  approaching  NBC 
vortices’  size  varies  between  200  —  400  km.  G/d  lies  in  the  range  0.07  —  0.3.  Although  this 

Mhe  definit  ion  of  the  vortex  initial  radius  is  not  stated  in  [13],  hence  Rr  and  d  (defined  in  [2])  are  not 
necessarily  equal. 
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Figure  5:  Sketch  illustrating  a  possible  formation  mechanism  for  the  large  Eastern  Caribbean  vortices  [11]. 

range  does  not  exactly  fit  in  0.25  <  G/d  <  0.4  obtained  in  [2],  it  is  natural  to  anticipate  that 
dipole  formation  is  likely  to  occur  downstream  of  the  Lesser  Antilles’  passages.  Assuming 
that  several  pairs  of  dipoles  are  formed  at  western  side  of  the  islands,  we  expect  that 
transition  from  small  scale  vortices  to  large  scale  structures  will  occur  by  the  merging  of 
vortices  of  like  sign  (Figure  5).  When  rotation  is  present,  the  scale  to  which  the  vortices 
grow  is  determined  by  instability  processes  that  inhibit  vortices  to  grow  to  scales  larger 
than  the  Rossby  radius  of  deformation  [9].  The  coalescence  of  same  sign  vortices  is  similar 
to  the  well-known  feature  of  inverse  energy  cascade  in  two-dimensional  flow  [10].  Finally, 
vortices  having  a  diameter  of  the  order  of  the  Rossby  radius  of  deformation  will  form  and 
drift  westward  due  to  the  planetary  /?-plane  (Figure  5). 

2  The  experiments 

2.1  Experimental  apparatus 

The  experiments  were  performed  in  a  square  tank  of  depth  45  cm,  length  and  width  of 
115  cm.  Both  ‘top- view’  and  ‘side-view’  illustrations  of  the  apparatus  are  shown  in  Figures 
6  &  7.  Some  experiments  were  carried  out  in  a  much  smaller  tank  (depth  36  cm,  length  and 
width  60  cm).  However,  as  we  are  interested  in  knowing  not  only  whether  or  not  several 
dipoles  are  formed  when  vortices  interact  with  a  chain  of  obstacles,  but  also  the  fate  of  the 
dipoles  (if  they  are  really  formed),  it  was  appropriate  to  focus  on  experiments  performed 
in  the  larger  tank.  The  apparatus  in  Figures  6  &  7  was  mounted  concentrically  on  a  2  m- 
diameter  rotating  turntable  with  a  vertical  axis  of  rotation.  The  sense  of  rotation  of  the 
turntable  was  anticlockwise.  A  square  tank  was  used  to  avoid  optical  distortion  from  side 
views  associated  with  a  circular  tank.  The  tank  had  a  sloping  bottom  which  makes  an  angle 
a  to  the  bottom  of  the  tank  in  order  for  a  vortex  to  self-propagate  leftward  when  looking 
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Figure  6:  Sketch  of  the  experimental  apparatus:  top  view. 


Figure  7:  Sketch  of  the  experimental  apparatus:  side  view. 


upslope  [4].  We  note  that  there  is  an  analogy  between  the  /3-plane  effect  and  the  sloping 
topography  effect  provided  that  the  angle  of  the  slope  a  and  the  Rossby  number  Ro  (the 
ratio  of  the  advection  term  to  the  Coriolis  term  in  the  horizontal  momentum  equations)  are 
sufficiently  small  [4].  The  shallowest  part  of  the  tank  corresponds  to  North.  Hence,  East 
is  to  the  right  when  looking  upslope,  West  is  to  the  left,  and  South  is  the  deepest  part  of 
the  tank.  The  tank  was  filled  with  fresh  water,  which  was  initially  in  solid  body  rotation. 
Seven  circular  cylinders  whose  diameter  is  D  were  aligned  in  the  North-South  direction,  and 
each  of  them  was  separated  by  a  gap  G  as  shown  in  Figures  6.  The  position  of  the  central 
cylinder  (the  fourth  one  from  North  (or  South))  was  always  fixed.  However,  the  position  of 
the  other  cylinders  could  be  changed  to  vary  the  value  G.  The  depth  of  the  water  at  the 
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Figure  8:  The  five  configurations  of  the  obstacles  used  in  the  experiments.  The  position  of  the  fourth 
cylinder  from  North  (or  South)  was  kept  fixed. 

central  cylinder,  ho,  was  chosen  to  be  11  cm  which  was  much  larger  than  the  Ekman  layer 
depth  6Ek  =  \/2 v/f  ss  3  mm,  where  v  is  the  kinematic  viscosity  of  the  water  and  /  is  the 
Coriolis  parameter.  The  bottom  of  each  cylinder  was  sliced  at  an  angle  so  it  rested  flush 
with  the  sloping  bottom. 

A  barotropic  cyclonic  vortex  was  generated  by  placing  an  ice  cube  in  the  water  [14], 
a  method  dynamically  similar  to  withdrawing  fluid  from  a  sink  positioned  on  the  sloping 
bottom.  The  water  surrounding  the  ice  cube,  due  to  conduction,  becomes  colder  than  the 
surrounding  water  and  sinks  as  a  cold  plume,  forming  a  cold  dense  lens  within  the  thin 
bottom  Ekman  layer.  The  dense  plume  induces  inward  velocities  along  the  entire  column 
depth  above  the  bottom  lens,  and  then,  influenced  by  the  Coriolis  force,  the  water  column 
(above  the  dense  lens)  starts  to  spin  cyclonically.  In  order  to  conserve  mass,  the  dense  fluid 
in  the  bottom  Ekman  layer  flows  radially  outwards  with  a  rapid  velocity  in  comparison  to 
the  rotation  period  of  the  tank  and  thus  a  dense  anticyclonic  vortex  does  not  form  on  the 
bottom.  The  fluid  within  the  dense  lens  moves  downslope  together  with  the  established 
barotropic  vortex  above  it.  Influenced  by  the  Coriolis  force,  both  the  cyclonic  water  column 
and  the  cold  lens  change  their  direction  and  start  drifting  westward  with  a  very  small 
meridional  displacement.  Although  NBC  rings  are  anticyclonic  vortices,  in  the  laboratory  it 
was  not  possible  to  reproduce  stable  barotropic  anticyclones  as  they  tend  to  be  centrifugally 
unstable  [8]  and  become  non-axisymmetric  in  a  few  rotation  periods.  Furthermore,  NBC 
rings  have  a  baroclinic  structure  and  move  within  a  stratified  fluid.  As  shown  by  [3],  the 
use  of  cyclonic  vortices  does  not  limit  the  generality  of  the  results,  which  can  be  easily 
extended  to  anticyclones.  In  particular,  the  circulation  equation  around  the  obstacle  and 
the  equation  relating  the  streamer  velocity  to  the  vortex  velocity  (the  equation  in  Figure 
3  (b))  still  hold  for  anticyclones.  We  neglected  the  effect  of  a  stratified  environment  and 
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the  influence  of  the  advection  mechanism  on  the  interaction  in  our  present  study.  Lack  of 
stratification  is  possibly  the  weakest  point  of  our  model  but  the  good  agreement  between 
the  results  obtained  in  [1,  2,  3]  and  the  oceanic  observations  suggests  that  stratification 
does  not  invalidate  the  relevance  of  the  results  discussed  here. 

For  all  the  experiments,  the  Coriolis  parameter  /  was  fixed  at  0.25  s_1,  and  v  = 
0.01  cm2s“1 .  The  bottom  slope  was  set  at  s  —  tan  a  =  0.5,  where  a  is  the  angle  between  the 
sloping  bottom  and  the  horizontal  so  that  the  self-propagating  vortex  could  move  westward 
with  a  speed  U  ~  0.2  cms~l.  The  vortex  was  produced  approximately  20  cm  westward  of 
the  eastern  wall  of  the  tank.  Hence,  the  vortex  moved  20  cm  westward  and  interacted  with 
the  chain  of  cylinders  before  the  spindown  time  r  =  ho/yfvj  ~  200  s.  The  diameter  of  the 
cylinders,  D ,  is  3.3  cm.  Three  values  for  the  size  of  the  gaps,  G  —  3,  1.5,  0.7  cm  and  five 
types  of  configurations  of  the  obstacles  (Figure  8)  were  studied.  The  azimuthal  velocity 
profile  of  the  vortex  in  the  experiments,  vq,  is  similar  to  that  of  a  Rankine  vortex  with  an 
approximately  constant  vorticity  (solid  body  rotation)  for  0  <  rf  <  rfmax  and  a  velocity 
which  decays  roughly  like  1  /rf  for  rl  >  Tfmax ,  where  r'  is  the  radial  coordinate  originating 
in  the  vortex  center.  We  define  the  vortex  radius  r  to  be  not  r'max  where  the  azimuthal 
velocity  of  the  vortex  is  maximum,  but  the  radial  distance  (from  the  center  of  the  vortex) 
where  the  velocity  has  decayed  by  approximately  30%  (i.e.  r  =  rfmax/Q.7).  This  definition 
for  the  vortex  radius  is  same  as  the  one  in  [1,  2].  The  incident  vortex  diameter  d  ranged 
between  7.6  —  19  cm  due  to  non-uniformity  of  the  size  of  the  ice  cubes  used. 

2.2  Measurements 

A  video  camera  was  mounted  above  the  tank  and  was  fixed  to  the  turntable  so  that  we  were 
able  to  observe  the  flow  in  the  rotating  frame.  For  half  of  the  experiments,  the  vortex  was 
made  visible  by  using  a  white  sloping  bottom,  dripping  dye  (food  coloring)  on  the  ice  cube 
and  adding  buoyant  paper  pellets  on  the  free  surface.  The  motion  of  the  dyed  vortex  was 
also  observed  from  the  side  of  the  tank.  For  the  rest  of  the  experiments,  the  paper  pellets  on 
the  free  surface  and  a  black  sloping  bottom  were  used  in  order  to  measure  the  velocity  field 
and  to  calculate,  for  instance,  the  circulation  of  the  vortex.  Images  were  grabbed  from  the 
recorded  video  tape  by  using  XCAP.  The  time  interval  between  each  image  was  0.15  5.  An 
image  processing  software,  DigiFlow  was  used  to  do  particle  tracking,  and  then  calculate 
the  velocity  field  by  mapping  the  individual  velocity  vectors  onto  a  rectangular  grid  using 
a  spatial  average  over  2  cm  and  the  time  average  over  10  5.  Once  the  gridded  velocity  was 
obtained,  quantities  such  as  the  position  (center),  the  radial  distance  rl  where  the  velocity 
is  maximum  (i.e.  rfmax ) ,  and  the  circulation  of  the  vortex  (before  and  after  the  interaction 
with  the  obstacles)  were  computed  by  using  Matlab.  Let  x,  y  be  the  zonal  and  meridional 
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coordinates  in  Figure  9,  respectively.  Further,  let  us  define  t/,  V  to  be  the  x- ,  y-component 
of  the  vortex  velocity,  respectively.  The  position  of  the  vortex  center  was  determined  as  the 
intersection  of  two  lines;  one  is  a  line  where  \U\  is  maximum  and  parallel  to  the  y-axis  while 
the  other  is  a  line  where  \V\  is  maximum  and  parallel  to  the  x-axis  (red  lines  in  Figure  9). 
Two  different  r^aa,  were  computed  by  defining  two  radial  distances  from  the  vortex  center, 
namely 

Tmax  =  max(ri,  r2,  r3,  r4)  ,  rav  =  average(ri,  r2,  r3,  r4)  , 

where  r\  is  the  radial  distance  from  the  vortex  center  to  the  eastward  location  where  \V\ 
is  maximum,  r<i  is  the  distance  from  the  vortex  center  to  the  northward  location  where 
\U\  is  maximum,  etc.  Defining  rmax,  rav  was  important  particularly  when  the  vortex  was 
distorted.  Since  rav  turned  out  to  be  better  in  general  (that  is,  \U\  and  \V\  are  almost 
maximum  at  r 7  —  rau),  we  focused  only  on  the  case  r'max  =  rav.  Once  rfmax  was  obtained, 
the  circulations  of  the  vortex  T#,  Ta  (before  and  after  the  interaction,  respectively)  were 
computed: 

r#  =  A'y'ujj , 

i 

where  A  is  the  area  of  the  single  grid  square,  and  is  the  relative  vorticity  at  each  grid 
point  within  rf  =  rlmax  =  rav^B  (r av,B  =  r av  before  the  interaction).  A  similar  expression 
can  be  written  for  F^.  By  looking  at  the  video  tape,  it  is  possible  to  know  the  number  of 
the  offsprings  N ,  whether  or  not  a  dipole  was  formed  (or  several  dipoles  were  formed),  and 
whether  there  was  a  backward  flow  (fluid  flowing  between  the  cylinders  from  west  to  east) 
or  not. 

3  Experimental  results 

3.1  G  =  3  cm 

Let  us  firstly  discuss  the  results  for  G  =  3  cm  with  d  —  7.6  ~  19  cm  (0.16  <  G/d  <  0.4) 
because  it  corresponds  to  the  kind  of  geometry  found  when  the  NBC  rings  interact  with 
the  Lesser  Antilles.  As  soon  as  a  vortex  encountered  the  obstacles,  a  dipole  almost  always 
formed  for  all  the  configurations  showed  in  Figure  8.  However,  the  formation  of  two  or  more 
dipoles  never  occurred  although  the  vortex  extended  for  several  cylinder  and  gap  lengths. 
After  a  dipole  formed,  the  cyclonic  part  of  the  dipole  became  dominant.  Depending  on  the 
configurations  of  the  obstacles  and  the  initial  vortex  position,  a  relatively  large  cyclonic 
offspring  was  produced  either  directly  from  the  cyclonic  part  of  the  dipole,  or  from  the 
“remnant”  of  the  original  vortex  at  the  gap  positioned  just  “South”  of  the  gap  where  the 
dipole  was  formed.  (The  vortex  moved  south  due  to  its  image  vortex.  The  degree  of 
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Figure  9:  Velocity  (arrows)  and  vorticity  (colors)  fields  of  a  typical  vortex  in  the  experiments.  The 
intersection  of  the  red  lines  is  the  center  of  the  vortex.  ( i  —  1  ...4)  are  perpendicular  distances  from 
t  he  center  to  the  green  lines.  The  circulation  based  on  rQV  is  the  sum  of  the  vorticity  at  each  of  the  yellow 
triangles. 

the  southward  movement  of  the  vortex  depends  on  the  configurations.)  The  number  of 
offsprings,  TV,  was  1  in  most  cases,  rarely  2,  and  never  0.  The  formation  of  large  vortices 
downstream  of  the  obstacles  is  surprising  as  the  gap  width  was  only  16  ~  40%  of  the 
initial  vortex  diameter.  Figures  10  &  11  show  two  laboratory  experiments  with  G  =  3 
cm.  Figure  10  shows  the  velocity  and  vorticity  fields  for  a  configuration  7,  while  Figure 
11  shows  an  experiment  with  configuration  3  in  which  a  white  sloping  bottom  and  dye 
were  used  to  visualize  the  flow.  By  Figures  12  Sz  13,  our  experimental  results  and  the 
numerical  observations  [13]  discussed  in  §1.2  can  be  compared.  According  to  Figure  12,  the 
relative  reduction  of  vortex  intensity  tends  to  be  large  (small  T a/Lb)  for  intense  vortices 
(large  F#),  as  observed  in  [13].  Figure  13(a)  (Figure  13  (b))  is  a  plot  of  ‘DiSi/rav^B  vs 
TV1  (‘ DiSi/rmuB  vs  Backward  flow’),  where  DiSi  is  the  total  length  of  the  ‘middle’  island 
(e.g.  Disi  =  D  =  3.3 cm  for  configuration  3,  DiSi  =  2D  =  6.6cm  for  configuration  4  etc). 
So  Disj I r(nu  b  is  equivalent  to  L/Rx  used  in  [13].  Figure  13  (a)  shows  that,  in  most  cases, 
N  =  1,  independently  of  the  value  of  DiS\/rQV,B  (or  L/Ri).  This  result  is  in  disagreement 
with  the  main  result  of  [13].  On  the  contrary,  Figure  13  (b)  agrees  with  a  result  of  [13] 
because  there  is  a  backward  flow  when  DiSi/rav^s  (°r  L/Ri )  is  small.  ([13]  says  a  vortex 
did  not  ‘notice’  the  existence  of  the  islands  if  L/Ri  is  small.) 
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(c)  (d) 

Figure  10:  Velocity  and  vorticity  fields  for  a  configuration  7  experiment:  (a)  just  before  the  interaction, 
t  =  19.5s;  (b)  dipole  formation,  t  =  48  s;  (c)  the  cyclonic  part  became  dominant,  t  =  102  s;  (d)  offspring 
formation,  t  =  126s. 

3.2  G  =  1.5  cm,  0.7  cm 

When  G  was  decreased,  a  different  behaviour  was  observed:  when  G  =  1.5  cm,  a  small 
dipole  still  formed  and  the  cyclonic  part  was  dominant,  and  N  =  0  or  1.  When  G  =  0.7  cm, 
a  small  portion  of  the  vortex  leaked  through  the  gaps  but  no  coherent  structure  was  formed 
(i.e.  always  N  =  0).  The  reason  why  the  vortex  generation  was  suppressed  for  G  =  0.7  cm 
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Figure  11:  A  dye  experiment  for  a  configuration  3:  (a)  just,  before  the  interaction,  t  =  7s;  (b)  dipole 
formation,  t  —  48s;  (c)  the  cyclonic  part  became  dominant,  t  —  144  s;  (d)  offspring  formation,  t  =  182s. 


Figure  12:  Ta/Tb  vs  Tb-  The  legend  represents  the  configurations  of  the  obstacles  illustrated  in  Figure 


Figure  13:  (a):  DiSi/rav,B  vs  N  where  DiSi  is  the  total  length  of  the  ‘middle5  island  (see  the  text  for 
details),  and  rav,B  is  rav  before  interaction  with  the  cylinders,  (b):  DiSi/rav,B  vs  ‘Backward  flow5.  *  are 
experiments  with  a  black  bottom  slope,  analysed  by  particle  tracking.  □  and  A  are  experiments  with  a  white 
bottom  slope  and  dye,  analysed  by  streak  and  eyes,  respectively. 


and  was  reduced  for  G  =  1.5  cm  might  be  explained  by  considering  the  thickness  of  the 
boundary  layers  (b.ls)  over  the  vertical  walls  of  the  obstacles.  (We  are  interested  in  only 
the  zonal  b.ls  since  the  flow  through  the  gaps  is  zonal.)  On  the  /-plane,  the  b.l  thickness  8 
is  expressed  as 

5  =  LEjI/El  =  , 

where  El  =  v/{VtL2)  is  the  Ekman  number  based  on  the  horizontal  length  scale  L,  Eh  = 
v/{i 1H2)  is  the  Ekman  number  based  on  the  vertical  length  scale  and  Q  is  the  rotation 
rate  of  the  system.  In  the  laboratory,  v  —  0.01  cm2s_1,  Q,  =  // 2  =  0.125s"1,  L  =  D  = 
3.3  cm,  and  H  =  ho  «  10  cm. 

=>  8  =  1.68  cm  . 

On  the  yS-plane,  two  kinds  of  zonal  Ekman  b.ls  exist,  namely 

8 zonal,  m  =  {8mL)^  ,  8zonai,s  ~  (8SL)  2  , 

where  8m  =  {v/(3 o)5  is  the  Munk  b.l,  8S  =  r/(3o  is  the  Stommel  b.l,  r  =  Sskf/i^H)  is  the 
linear  friction  coefficient,  /3q  is  the  beta  parameter,  and  S^k  =  y/2ujf  is  the  bottom  Ekman 
layer  depth.  /?o  =  sf  /H  =  0.0125  s"1  cm"1  in  the  laboratory. 


^2ona/,m  —  1.26  cm  ,  8zonai^ 


0.97  cm  . 


Therefore,  the  largest  thickness  of  these  zonal  b.ls  is  1.68  cm.  If  the  gap  width  G  between 
the  cylinders  is  much  smaller  than  3  cm,  then  the  viscous  b.ls  occupy  the  entire  region 
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within  each  gaps,  and  consequently,  the  presence  of  the  b.ls  make  the  fluid  within  the  gaps 
slow.  This  is  the  reason  why  the  number  of  offsprings,  N ,  decreased  as  G  was  reduced. 

4  Conclusions  and  further  studies 

For  G  =  3  cm,  a  dipole  was  observed  to  form  in  most  experiments  for  all  the  configurations 
of  the  islands,  however  the  formation  of  two  or  more  dipoles  never  occurred.  This  result 
invalidates  our  hypothesis.  After  a  dipole  formed,  the  cyclonic  part  of  the  dipole  became 
dominant.  Depending  on  the  configurations  of  the  obstacles  and  the  initial  vortex  position, 
a  relatively  large  offspring  was  produced  either  directly  from  the  cyclonic  part  of  the  dipole, 
or  from  the  “remnant”  of  the  original  vortex  at  the  gap  positioned  just  “South”  of  the  gap 
where  the  dipole  was  formed.  We  also  found  that  intense  vortices  experienced  relatively 
greater  amplitude  loss  than  weak  vortices,  and  the  number  of  offspring,  N,  was  1  in  general, 
independently  of  the  size  of  the  ‘middle’  islands.  Observations  of  drifters  in  the  Caribbean 
Sea  [11]  mentioned  in  §1.1  might  be  explained  from  our  experimental  results.  According 
to  [11],  large  energetic  anticyclonic  vortices  were  found  downstream  of  the  Lesser  Antilles 
and  translated  westward  while  cyclonic  vortices  were  observed  primarily  near  boundaries  in 
the  Eastern  Caribbean  Sea.  The  weak  cyclonic:  vortices  may  have  been  produced  from  the 
cyclonic  part  of  a  dipole  formed  when  a  NBC  ring  collided  with  the  islands.  It  seems  likely 
that  the  dominant  anticyclonic  offspring  was  formed  either  directly  from  the  anticyclonic 
part  of  the  dipole,  or  from  the  “remnant”  of  the  original  vortex  (after  the  dipole  formation). 
When  G  is  smaller  than  a  critical  value  (G  <  0.7 cm),  no  vortices  were  formed.  This  may 
suggest  that  for  small  enough  island  passages,  no  vortices  are  formed  in  the  ocean  due  to 
the  presence  of  b.ls  that  can  slow  the  fluid  within  the  gaps.  This  hypotheses  is  hard  to 
prove  because  the  oceanic  kinematic  viscosity  v  is  not  known  for  this  particular  process. 

In  the  present  study,  the  vortices  were  cyclonic  and  barotropic.  Moreover,  they  ap¬ 
proached  perpendicularly  to  the  chain  of  obstacles.  On  the  contrary,  in  the  ocean,  NBC 
rings  are  anticyclonic  and  baroclinic,  and  they  move  along  an  oblique  direction  to  the  line 
of  the  islands.  It  would  be  interesting  to  see  how  the  results  described  above  would  be 
modified  by  the  inclusions  of  these  details  (i.e.  anticyclonic  vortices,  baroclinicity,  direction 
of  propagation)  in  the  laboratory  experiments. 
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